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' EXPRESS MAIL CERTIFICATE 
"EXPRESS MAIL" MAILING LABEL NUMBER EL229502579US 
DATE OF DEPOSIT 23 March 2000 

Attorney Docket No. B451 10 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 
Applicant: Brack, et al. 23 March 2000 

International App. No.: PCT/EP9 8/06040 Group Art Unit No.: Unknown 

International Filing Date: 17 September 1998 Examiner: Unknown 

For: FUSION PROTEINS COMPRISING HIV-1 TAT AND/OR NEF PROTEINS 

Assistant Commissioner of Patents 
Box: PCT 

Washington, D.C. 20231 

PRELIMINARY AMENDMENT 

Preliminary to the examination of this application, applicants respectfully request 
amendment of the above-identified application as follows: 

IN THE CLAIMS : 

Please delete claims 1-31. 

Please add new claims 32-77. 
32. A vaccine composition which comprises a protein comprising 

(a) an HIV Tat protein or derivative thereof linked to either (i) a fusion partner or 
(ii) an HIV Nef protein or derivative thereof; or 

(b) an HIV Nef protein or derivative thereof linked to either (i) a fusion partner or (ii) 
an HIV Tat protein or derivative thereof; or 

(c) an HIV Nef protein or derivative thereof linked to an HIV Tat protein or 
derivative thereof and a fusion partner, 

in admixture with a pharmaceutically acceptable excipient. 



33. A composition as claimed in claim 32, comprising a Tat-Nef fusion protein or derivative 
thereof. 



-2- 

Intl. App. No.: PCT/EP98/06040 
Docket No. B45110 

34. A composition as claimed in claim 32, comprising a Nef-Tat fusion protein or derivative 
thereof. 

35. A composition according to claim 32, wherein the derivative of the Tat protein is a 
mutated Tat protein. 

36. A composition according to claim 32, wherein the derivative of the Nef protein is a 
mutated Nef protein. 

37. A composition as claimed in claim 32, wherein the fusion partner is a lipoprotein or 
derivative thereof. 

38. A composition as claimed in claim 37, wherein the lipoprotein is Haemophilus Influenza 
B protein D or derivative thereof. 

39. A composition as claimed in claim 38, wherein the fusion partner comprises between 
100-130 amino acid from the N terminal of Haemophilus Influenza B protein D. 

40. A composition as claimed in claim 32, wherein the Tat protein is the entire Tat protein. 

41 . A composition as claimed in claim 32, wherein the Nef protein is the entire Nef protein. 

42. A composition as claimed in claim 32, wherein the Tat protein is fused to an HIV Nef 
protein and a fusion partner. 

43. A composition as claimed in claim 32, wherein the protein has a Histidine tail. 

44. A composition as claimed in claim 32, wherein the protein is carboxymethylated. 

45. A composition as claimed in claim 32, additionally comprising an adjuvant. 

46. A composition as claimed in claim 45, wherein the adjuvant is a TH1 inducing adjuvant. 
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47. A composition as claimed in claim 45 which adjuvant comprises monophosphoryl lipid 
A or a derivative thereof such as 3 de-O-acylated monophosphoryl lipid A. 

48. A composition as claimed in claim 45, additionally comprising a saponin adjuvant. 

49. A composition as claimed in any one of claims 45 to 48 which additionally comprises an 
oil in water emulsion. 

50. A composition as claimed in claim 32 further comprising HIV gpl60 or its derivative 
gpl20. 

51. A composition as claimed in claim 45 further comprising HIV gp 1 60 or its derivative 
gpl20. 

52. A composition as claimed in claim 48 further comprising HIV gpl60 or its derivative 
gpl20. 

53. A composition as claimed in claim 49 further comprising HIV gpl60 or its derivative 
gpl20. 

54. A protein comprising an HIV Tat protein or derivative thereof linked to an HIV Nef 
protein or derivative thereof in Nef-Tat or Tat-Nef orientation. 

55. A nucleic acid encoding a protein of claim 54. 

56. A host transformed with a nucleic acid of claim 55. 

57. A host as claimed in claim 56 wherein the host is either E. coli or Pichia pastoris. 



58. A method of producing a protein of claim 54, comprising providing a host as claimed in 
claim 56 or 57, expressing said protein and recovering the protein. 
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59. A method of preparing (i) an HIV Nef protein or derivative thereof or (ii) an HIV Tat 
protein or derivative thereof in Pichia pastoris which method comprises the steps of 
transforming Pichia pastoris with DNA encoding said HIV Nef protein or derivative 
thereof of HIV Tat protein or derivative thereof, expressing said protein and recovering 
the protein. 

60. The method of claim 58 further comprising a carboxymethylation step performed on the 
expressed protein. 

61 . The method of claim 59 further comprising a carboxymethylation step performed on the 
expressed protein. 

62. A method of producing a vaccine, comprising admixing the protein from claim 58 with a 
pharmaceutically acceptable diluent, 

63. A method of producing a vaccine, comprising admixing the protein from claim 59 with a 
pharmaceutically acceptable diluent. 

64. A method of producing a vaccine, comprising admixing the protein from claim 60 with a 
pharmaceutically acceptable diluent. 

65. The method of claim 62 further comprising the addition of HIV gpl60 or its derivative 
gpl20. 

66. The method of claim 63 further comprising the addition of HIV gpl60 or its derivative 
gpl20. 

67. The method of claim 64 further comprising the addition of HIV gpl60 or its derivative 
gpl20. 
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68. The method of claim 58 further comprising the addition of an adjuvant, particularly a 
TH1 inducing adjuvant. 



69. The method of claim 59 further comprising the addition of an adjuvant, particularly a 
TH1 inducing adjuvant. 

70. The method of claim 60 further comprising the addition of an adjuvant, particularly a 
TH1 inducing adjuvant. 

71. The method of claim 61 further comprising the addition of an adjuvant, particularly a 
TH1 inducing adjuvant. 

72. The method of claim 62 further comprising the addition of an adjuvant, particularly a 
TH1 inducing adjuvant. 

73. The method of claim 63 further comprising the addition of an adjuvant, particularly a 
TH1 inducing adjuvant. 

74. The method of claim 64 further comprising the addition of an adjuvant, particularly a 
TH1 inducing adjuvant. 

75. The method of claim 65 further comprising the addition of an adjuvant, particularly a 
TH1 inducing adjuvant. 

76. A vaccine composition comprising a recombinant Tat-containing protein formulated with 
a mixture of 3D-MPL, QS21 and an oil in water emulsion. 

77. A composition as claimed in claim 76 wherein the oil in water emulsion comprises 
squalene, polyoxyethylene sorbitan monooleate and a-tocopherol. 
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REMARKS 

The above-identified application is being entered into the National Phase from PCT 
application no. PCT/EP98/06040. 

Applicants have deleted claims 1-31 and added new claims 32-77 to put the claims in 
conformity with U.S. practice. 

No new matter has been introduced. 



Respectfully submitted, 




Zoltan Kerekes 
Attorney for Applicants 
Registration No. 38,938 
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Corporate Intellectual Property - UW2220 
P.O. Box 1539 

King of Prussia, PA 19406-0939 
Phone (610) 270-5024 
Facsimile (610) 270-5090 
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FUSION PROTEINS COMPRISING HIV-1 TAT AND/OR NEF PROTEINS 

The present invention relates to novel HIV protein constructs, to their use in medicine, 
5 to pharmaceutical compositions containing them and to methods of their manufacture. 

In particular, the invention relates to fusion proteins comprising HIV-1 Tat and/or Nef 
proteins. 



10 HIV-1 is the primary cause of the acquired immune deficiency syndrome (AIDS) 
which is regarded as one of the world's major health problems. Although extensive 
research throughout the world, has been conducted to produce a vaccine, such efforts 
thus far, have not been successful. 

1 5 Non-envelope proteins of HTV-1 have been described and include for example internal 
structural proteins such as the products of the gag and pol genes and, other non- 
structural proteins such as Rev, Nef, Vif and Tat (Greene et al. 9 New England J. Med, 
324, 5, 308 et seq (1991) and Bryant et al. (Ed. Pizzo), Pediatr. Infect. Dis. J. ? 11, 5, 
390 etseq (1992). 

20 

HIV Nef and Tat proteins are early proteins, that is, they are expressed early in 
infection and in the absence of structural proteins. 

According to the present invention there is provided a protein comprising 
25 (a) an HIV Nef protein or derivative thereof linked to either (i) a fusion partner or 

(ii) an HIV Tat protein or derivative thereof; or 

(b) an HIV Tat protein or derivative thereof linked to either (i) a fusion partner or 
(ii) an HIV Nef protein or derivative thereof; or 

(c) an HIV Nef protein or derivative thereof linked to an HIV Tat protein or 
30 derivative thereof and a fusion partner. 

By 'fusion partner' is meant any protein sequence that is not Tat or Nef. 
Preferably the fusion partner is protein D or its' lipidated derivative Lipoprotein D, 
from HaemopMius influenzae B. In particular, it is preferred that the N-terminal 
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third, i.e. approximately the first 100-130 amino acids are utilised. This is represented 
herein as Lipo D 1/3. In a preferred embodiment of the invention the Nef protein or 
derivative thereof may be linked to the Tat protein or derivative thereof. Such Nef- 
Tat fusions may optionally also be linked to an fusion partner, such as protein D. 

5 

The fusion partner is normally linked to the N-terminus of the Nef or Tat protein. 

Derivatives encompassed within the present invention include molecules with a C 
terminal Histidine tail which preferably comprises between 5-10 Histidine residues. 
10 Generally, a histidine tail containing n residues is represented herein as His (n). The 
presence of an histidine (or 'His') tail aids purification. More specifically, the 
invention provides proteins with the following structure 



15 



Lipo D 1/3 



Nef 



His ( 6 ) 



Lipo D 1/3 



Nef-Tat 



His ( 6 ) 



ProtDl/3 - Nef - His ( 6 ) 

20 ProtDl/3 - Nef-Tat - His ( 6 ) 

Nef-Tat - His ( 6 ) 



Figure 1 provides the amino-acid (Seq. ID. No. 7) and DNA sequence (Seq. ID. No. 6) 
25 of the fusion partner for such constructs. 

In a preferred embodiment the proteins are expressed with a Histidine tail comprising 
between 5 to 10 and preferably six Histidine residues. These are advantageous in 
aiding purification. Separate expression, in yeast (Saccharomyces cerevisiae), of Nef 
30 (Macreadie LG. et aL, 1993, Yeast 9 (6) 565-573) and Tat (Braddock M et al., 1989, 
Cell 58 (2) 269-79) has already been reported. Nef protein only is myristilated. The 
present invention provides for the first time the expression of Nef and Tat separately 
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in a Pichia expression system (Nef-His and Tat-His constructs), and the successful 
expression of a fusion construct Nef-Tat-His. The DNA and amino acid sequences of 
representative Nef-His (Seq. ID. No.s 8 and 9), Tat-His (Seq. ID. No.s 10 and 1 i)and 
of Nef-Tat-His fusion proteins (Seq. ID. No.s 12 and 13)are set forth in Figure 2. 

5 

Derivatives encompassed within the present invention also include mutated proteins. 
The term 'mutated' is used herein to mean a molecule which has undergone deletion, 
addition or substitution of one or more amino acids using well known techniques for 
site directed mutagenesis or any other conventional method. 

10 

A mutated Tat is illustrated in Figure 2 (Seq. ID. No.s 22 and 23) as is a Nef-Tat 
Mutant-His (Seq. ID. No.s 24 and 25). 

The present invention also provides a DNA encoding the proteins of the present 
1 5 invention. Such sequences can be inserted into a suitable expression vector and 
expressed in a suitable host. 

A DNA sequence encoding the proteins of the present invention can be synthesized 
using standard DNA synthesis techniques, such as by enzymatic ligation as described 
20 by D.M. Roberts et al in Biochemistry 1985, 24, 5090-5098, by chemical synthesis, 
by in vitro enzymatic polymerization, or by PCR technology utilising for example a 
heat stable polymerase, or by a combination of these techniques. 

Enzymatic polymerisation of DNA may be carried out in vitro using a DNA 
25 polymerase such as DNA polymerase I (Klenow fragment) in an appropriate buffer 
containing the nucleoside triphosphates dATP, dCTP, dGTP and dTTP as required at 
a temperature of 10°-37°C, generally in a volume of 50jil or less. Enzymatic ligation 
of DNA fragments may be carried out using a DNA ligase such as T4 DNA ligase in 
an appropriate buffer, such as 0.05M Tris (pH 7.4), 0.01M MgCl 2 , 0.01M 
30 dithiothreitol, ImM spermidine, ImM ATP and O.lmg/ml bovine serum albumin, at a 
temperature of 4°C to ambient, generally in a volume of 50ml or less. The chemical 
synthesis of the DNA polymer or fragments may be carried out by conventional 
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phosphotriester, phosphite or phosphoramidite chemistry, using solid phase 
techniques such as those described in 4 Chemical and Enzymatic Synthesis of Gene 
Fragments - A Laboratory Manual' (ed. H.G. Gassen and A. Lang), Verlag Chemie, 
Weinheim (1982), or in other scientific publications, for example MJ. Gait, H.W.D. 

5 Matthes, M. Singh, B.S. Sproat, and R.C. Titmas, Nucleic Acids Research, 1982, 10, 
6243; B.S. Sproat, and W. Bannwarth, Tetrahedron Letters, 1983, 24, 5771; M.D. 
Matteucci and M.H. Caruthers, Tetrahedron Letters, 1980, 21, 719; M.D. Matteucci 
and M.H. Caruthers, Journal of the American Chemical Society, 1981, 103, 3185; S,P. 
Adams et aL, Journal of the American Chemical Society, 1983, 105, 661; N.D. Sinha, 

10 J. Biernat, J. McMannus, and H. Koester, Nucleic Acids Research, 1984, 12, 4539; 
and H.W.D. Matthes et aL, EMBO Journal, 1984, 3, 801. 



The invention also provides a process for preparing a protein of the invention, the 
1 5 process comprising the steps of : 



i) preparing a replicable or integrating expression vector 
capable, in a host cell, of expressing a DNA polymer 
comprising a nucleotide sequence that encodes the protein or 

20 a derivative thereof 

ii) transforming a host cell with said vector 

iii) culturing said transformed host cell under conditions 
permitting expression of said DNA polymer to produce said 
protein; and 

25 iv) recovering said protein 



The process of the invention may be performed by conventional recombinant 
techniques such as described in Maniatis et aL, Molecular Cloning - A Laboratory 
Manual; Cold Spring Harbor, 1982-1989. 



The term 'transforming' is used herein to mean the introduction of foreign DNA into 

host cell This can be achieved for example by transformation, transfection or 
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infection with an appropriate plasmid or viral vector using e.g. conventional 
techniques as described in Genetic Engineering; Eds. S.M. Kingsman and A. J. 
Kingsman; Blackwell Scientific Publications; Oxford, England, 1988. The term 
'transformed' or 'transformant' will hereafter apply to the resulting host cell 
5 containing and expressing the foreign gene of interest. 

The expression vectors are novel and also form part of the invention. 

The replicable expression vectors may be prepared in accordance with the invention, 
10 by cleaving a vector compatible with the host cell to provide a linear DNA segment 
having an intact replicon, and combining said linear segment with one or more DNA 
molecules which, together with said linear segment encode the desired product, such 
as the DNA polymer encoding the protein of the invention, or derivative thereof, 
under ligating conditions. 

15 

Thus, the DNA polymer may be preformed or formed during the construction of the 
vector, as desired. 

The choice of vector will be determined in part by the host cell, which may be 
20 prokaryotic or eukaryotic but preferably is E. colt or yeast. Suitable vectors include 
plasmids, bacteriophages, cosmids and recombinant viruses. 

The preparation of the replicable expression vector may be carried out conventionally 
with appropriate enzymes for restriction, polymerisation and ligation of the DNA, by 
25 procedures described in, for example, Maniatis et al cited above. 

The recombinant host cell is prepared, in accordance with the invention, by 
transforming a host cell with a replicable expression vector of the invention under 
transforming conditions. Suitable transforming conditions are conventional and are 
30 described in, for example, Maniatis et al cited above, or "DNA Cloning" Vol. II, 
D.M. Glover ed., IRL Press Ltd, 1985. 
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The choice of transforming conditions is determined by the host cell. Thus/a 
bacterial host such as E. coli may be treated with a solution of CaCl 2 (Cohen et al. 9 
Proc. Nat. Acad. Sci., 1973, 69, 2110) or with a solution comprising a mixture of 
RbCl, MnCU, potassium acetate and glycerol, and then with 3-[N-morpholino]- 
5 propane-sulphonic acid, RbCl and glycerol. Mammalian cells in culture may be 
transformed by calcium co-precipitation of the vector DNA onto the cells. The 
invention also extends to a host cell transformed with a replicable expression vector of 
the invention. 

10 Culturing the transformed host cell under conditions permitting expression of the 

DNA polymer is carried out conventionally, as described in, for example, Maniatis et 
al and "DNA Cloning" cited above. Thus, preferably the cell is supplied with 
nutrient and cultured at a temperature below 50°C. 

1 5 The product is recovered by conventional methods according to the host cell. Thus, 
where the host cell is bacterial, such as E. coli - or yeast such as Pichia; it may be 
lysed physically, chemically or enzymatically and the protein product isolated from 
the resulting lysate. Where the host cell is mammalian, the product may generally be 
isolated from the nutrient medium or from cell free extracts. Conventional protein 

20 isolation techniques include selective precipitation, adsorption chromatography, and 
affinity chromatography including a monoclonal antibody affinity column. 

For proteins of the present invention provided with Histidine tails, purification can 
easily be achieved by the use of a metal ion affinity column. In a preferred 
25 embodiment, the protein is further purified by subjecting it to cation ion exchange 
chromatography and/or Gel filtration chromatography. The protein is then sterilised 
by passing through a 0.22 jum membrane. 

The proteins of the invention can then be formulated as a vaccine, or the Histidine 
30 residues enzymatically cleared. 
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The proteins of the present invention are provided preferably at least 80% pure more 
preferably 90% pure as visualised by SDS PAGE. Preferably the proteins appear as a 
single band by SDS PAGE. 

5 The present invention also provides pharmaceutical composition comprising a protein 
of the present invention in a pharmaceutical^ acceptable excipient 

Vaccine preparation is generally described in New Trends and Developments in 
Vaccines, Voller et aL (eds.). University Park Press, Baltimore, Maryland, 1978. 
10 Encapsulation within liposomes is described by Fullerton, US Patent 4,235,877. 

The proteins of the present invention are preferably adjuvanted in the vaccine 
formulation of the invention. Suitable adjuvants include an aluminium salt such as 
aluminium hydroxide gel (alum) or aluminium phosphate, but may also be a salt of 
1 5 calcium, iron or zinc, or may be an insoluble suspension of acylated tyrosine, or 
acylated sugars, cationically or anionically derivatised polysaccharides, or 
polyphosphazenes. 

In the formulation of the inventions it is preferred that the adjuvant composition 
20 induces a preferential TH1 response. Suitable adjuvant systems include, for example, 
a combination of monophosphoryl lipid A or derivative thereof, preferably 3-de-O- 
acylated monophosphoryl lipid A (3D-MPL) together with an aluminium salt. 

An enhanced system involves the combination of a monophosphoryl lipid A and a 
25 saponin derivative particularly the combination of QS21 and 3D- MPL as disclosed in 
WO 94/00153, or a less reactogenic composition where the QS21 is quenched with 
cholesterol as disclosed in WO 96/33739. 

A particularly potent adjuvant formulation involving QS21, 3D-MPL & tocopherol in 
30 an oil in water emulsion is described in WO 95/17210 and is a preferred formulation. 
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Accordingly in one embodiment of the present invention there is provided a vaccine 
comprising a protein according to the invention adjuvanted with a monophosphoryl 
lipid A or derivative thereof, especially 3D-MPL. 

5 Preferably the vaccine additionally comprises a saponin, more preferably QS2L 

Preferably the formulation additional comprises an oil in water emulsion and 
tocopherol. The present invention also provides a method for producing a vaccine 
formulation comprising mixing a protein of the present invention together with a 
1 0 pharmaceutical^ acceptable excipient, such as 3D-MPL. 

The vaccine of the present invention may additional comprise further HIV proteins, 
such as the envelope glycoprotein gpl60 or its derivative gp 120. 

15 In another aspect, the invention relates to an HIV Nef or an HIV Tat protein or 
derivative thereof expressed in Pichia pastor is. 

The invention will be further described by reference to the following examples: 
20 EXAMPLES: 
General 

Nef and Tat proteins, two regulatory proteins encoded by the human 
25 immunodeficiency virus (HIV-1) were produced in E.coli and in the methylotrophic 
yeast Pichia past or is. 

The nef gene from the Bru/Lai isolate (Cell 40: 9-17, 1985) was selected for these 
constructs since this gene is among those that are most closely related to the 
30 consensus Nef . 
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The starting material for the Bru/Lai nef gene was a 1 170bp DNA fragment cloned on 
the mammalian expression vector pcDNA3 (pcDNA3/«e/). 

The tat gene originates from the BH10 molecular clone. This gene was received as an 
5 HTLV III cDNA clone named pCVl and described in Science, 229, p69-73, 1985. 

1. EXPRESSION OF HIV-1 nef AND tat SEQUENCES IN E.COLI. 

Sequences encoding the Nef protein as well as a fusion of nef and tat sequences were 
1 0 placed in plasmids vectors: pRIT14586 and pRIT14589 (see figure 1). 

Nef and the Nef-Tat fusion were produced as fusion proteins using as fusion partner a 
part of the protein D. Protein D is an immunoglobulin D binding protein exposed at 
the surface of the gram-negative bacterium Haemophilus influenzae, 

15 

pRIT14586 contains, under the control of a APL promoter, a DNA sequence derived 
from the bacterium Haemophilus influenzae which codes for the first 127 amino acids 
of the protein D (Infect. Immun. 60 : 1336-1342, 1992), immediately followed by a 
multiple cloning site region plus a DNA sequence coding for one glycine, 6 histidines 
20 residues and a stop codon (Fig. 1 A). 

This vector is designed to express a processed lipidated His tailed fusion protein 
(LipoD fusion protein). The fusion protein is synthesised as a precursor with an 18 
amino acid residues long signal sequence and after processing, the cysteine at position 
25 19 in the precursor molecule becomes the amino terminal residue which is then 
modified by covalently bound fatty acids (Fig. IB). 

pRIT14589 is almost identical to pRIT14586 except that the protD derived sequence 
starts immediately after the cysteine 19 codon. 
30 Expression from this vector results in a His tailed, non lipidated fusion protein 
(Prot D fusion protein). 
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Four constructs were made: LipoD-rce/-His, 
ProtD-He/-ta/~His. 



PCT/EP98/06040 

LipoD-He/-ta/-His, ProtD-we/His, and 



The first two constructs were made using the expression vector pRIT 14586, the last 
5 two constructs used pRIT 14589. 

1.1 CONSTRUCTION OF THE RECOMBINANT STRAIN ECLD-N1 
PRODUCING THE LIPOD-Nef-HIS FUSION PROTEIN. 

10 

1.1.1 Construction of the lipoD-we^His expression plasmid pRIT14595 

The nef gene(Bru/Lai isolate) was amplified by PGR from pcDNA3/Nef plasmid with 
primers 01 and 02. 

15 

Ncol 

PRIMER 01 (Seq ID NO 1): 5'ATCGT CCATG.G GT.GGC.AAG.TGG.T V 

20 Spel 

PRIMER 02 (Seq ID NO 2): 5' CGGCT ACTAGT GCAGTTCTTGAA 3' 

The nefDNA region amplified starts at nucleotide 8357 and terminates at nucleotide 
8971 (CeU,40?9-17 s 1985). 

25 

An Ncol restriction site ( which carries the ATG codon of the nef gene) was 
introduced at the 5' end of the PCR fragment while a Spel site was introduced at the 3' 
end. 

30 The PCR fragment obtained and the expression plasmid pRIT14586 were both 

restricted by Ncol and Spel, purified on an agarose gel, ligated and transformed in the 
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appropriate E.coli host cell, strain AR58This strain is a cryptic X lysogen derived 
from N99 that is ga/E::TnlO, A-8 (chlD-pgl), A-Hl (cro-chlA\ N\ and cI857. 

The resulting recombinant plasmid received, after verification of the nef amplified 
5 region by automatic sequencing,(see section LL2 below) the pRIT14595 
denomination. 

1.1.2 Selection of transformants of E. Coli strain AR58 with pRIT14595 

10 

When transformed in AR58 E.coli host strain, the recombinant plasmid directs the 
heat-inducible production of the heterologous protein. 

Heat inducible protein production of several recombinant lipoD-Nef-His 
1 5 transformants was analysed by Coomassie Blue stained SDS-PAGE. All the 

transformants analysed showed an heat inducible heterologous protein production. 
The abundance of the recombinant Lipo D-Nef-Tat-His fusion protein was estimated 
at 10% of total protein. 

20 One of the transformants was selected and given the laboratory accession number 
ECLD-N1. 

The recombinant plasmid was reisolated from strain ECLD-N1, and the sequence of 
the nef-His coding region was confirmed by automated sequencing This plasmid 
25 received the official designation pRIT14595. 

The fully processed and acylated recombinant Lipo D-nef-His fusion protein produced 
by strain ECLD-N1 is composed of: 

30 °Fatty acids 

°109 a.a. of proteinD (starting at a.a.19 and extending to a.a.127). 
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°A methionine, created by the use of Ncol cloning site of pRIT14586 
(Fig.1). 

°205a.a. of Nef protein (starting at a.a.2 and extending to a.a.206). 
°A threonine and a serine created by the cloning procedure (cloning at Spel 
5 site of pRIT14586). 

°One glycine and six histidines. 



1 .2 CONSTRUCTION OF RECOMBINANT STRAIN ECD-N1 
PRODUCING PROT D-Nef-HIS FUSION PROTEIN. 

10 

Construction of expression plasmid pRIT 14600 encoding the Prot D-Nef-His fusion 
protein was identical to the plasmid construction described in example 1.1.1 with the 
exception that pRIT14589 was used as receptor plasmid for the PCR amplified nef 
fragment. 

15 

E.coli AR58 strain was transformed with pRIT 14600 and transformants were analysed 
as described in example 1.1.2. The transformant selected received laboratory 
accession number ECD-N1 . 
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13 CONSTRUCTION OF RECOMBINANT STRAIN ECLD-NT6 
PRODUCING THE LIPO D-Nef-Tat-HIS FUSION PROTEIN. 

1.3.1 Construction of the lipo D-Nef-Tat-His expression plasmid pRIT14596 

5 

The tat gene(BH10 isolate) was amplified by PCR from a derivative of the pCVl 
plasmid with primers 03 and 04, Spel restriction sites were introduced at both ends of 
the PCR fragment. 

10 Spel 

PRIMER 03 (Seq ID NO 3): 5' ATCGT ACTAGT. GAG.CCA.GTA.GAT.C 3* 

Spel 

PRIMER 04 (Seq ID NO 4): 5 5 CGGC TACTAGT TTCCTTCGGGCCT 3' 

15 

The nucleotide sequence of the amplified tat gene is illustrated in the pCVl clone 
(Science 229 : 69-73, 1985) and covers nucleotide 5414 till nucleotide 7998. 

20 The PCR fragment obtained and the plasmid pRIT14595 (expressing lipoD-Nef-His 
protein) were both digested by Spel restriction enzyme, purified on an agarose gel, 
ligated and transformed in competent AR58 cells. The resulting recombinant plasmid 
received, after verification of the tat amplified sequence by automatic sequencing (see 
section 1.3.2 below), the pRIT14596 denomination. 



25 



1.3.2 Selection of transformants of strain AR58 with pRIT14596 



Transformants were grown, heat induced and their proteins were analysed by 
Coomassie Blue stained gels. The production level of the recombinant protein was 
30 estimated at 1% of total protein. One recombinant strain was selected and received 
the laboratory denomination ECLD-NT6. 
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The lipoD-«<?/ tat -His recombinant plasmid was reisolated from ECLD-NT6 strain, 
sequenced and received the official designation pWT14596. 

The fully processed and acylated recombinant Lipo D-Nef-Tat-His fusion protein 
5 produced by strain ECLD-N6 is composed of: 

°Fatty acids 

°109 a.a. of proteinD (starting at a.a.19 and extending to a.a.127). 
°A methionine, created by the use of Ncol cloning site of pRIT14586. 
1 0 °205a.a. of the Nef protein (starting at a.a.2 and extending to a.a.206) 

°A threonine and a serine created by the cloning procedure 
°85a.a. of the Tat protein (starting at a.a.2 and extending to a.a.86) 
°A threonine and a serine introduced by cloning procedure 
°One glycine and six histidines. 

15 

1.4 CONSTRUCTION OF RECOMBINANT STRAIN ECD-NT1 
PRODUCING PROT D-Nef-Tat-HIS FUSION PROTEIN. 

Construction of expression plasmid pRIT 14601 encoding the Prot D-Nef-Tat-His 
20 fusion protein was identical to the plasmid construction described in example 13.1 
with the exception that pRIT14600 was used as receptor plasmid for the PCR 
amplified ne/fragment. 

Kcoli AR58 strain was transformed with pRIT14601 and transformants were analysed 
25 as described previously. The transformant selected received laboratory accession 
number ECD-NT1. 



30 



- 14- 



WO 99/16884 PCT/EP98/06040 

2. EXPRESSION OF HIV-1 nef AND tat SEQUENCES IN PICHIA 
PASTORIS. 

Nef protein. Tat protein and the fusion Nef -Tat were expressed in the methylotrophic 
5 yeast Pichia pastor is under the control of the inducible alcohol oxidase (AOX1) 
promoter. 

To express these HIV-1 genes a modified version of the integrative vector PHIL-D2 
(INVITROGEN) was used. This vector was modified in such a way that expression of 

1 0 heterologous protein starts immediately after the native ATG codon of the AOX1 

gene and will produce recombinant protein with a tail of one glycine and six histidines 
residues . This PHIL-D2-MOD vector was constructed by cloning an oligonucleotide 
linker between the adjacent AsuII and EcoRI sites of PHIL-D2 vector (see Figure 3). 
In addition to the His tail, this linker carries Ncol, Spel and Xbal restriction sites 

1 5 between which nef tat and nef tat fusion were inserted. 

2. 1 CONSTRUCTION OF THE INTEGRATIVE VECTORS pRIT14597 
(encoding Nef-His protein), pRIT14598 (encoding Tat-His protein) and 
pRIT14599 (encoding fusion Nef-Tat-His). 

20 

The nef gene was amplified by PCR from the pcDNA3/Nef plasmid with primers 01 
and 02(see section 1.1.1 construction of pRIT14595).The PCR fragment obtained and 
the integrative PHIL-D2-MOD vector were both restricted by Ncol and Spel, purified 
on agarose gel and ligated to create the integrative plasmid pRIT14597 (see Figure 3). 

25 

The tat gene was amplified by PCR from a derivative of the pCVl plasmid 
with primers 05 and 04(see section 1.3.1 construction of pRIT 14596): 

Ncol 

30 PRIMER 05 (Seq ID NO 5): 5 ' ATCGTCCATGGAGCC AGTAGATC 3' 
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An Ncol restriction site was introduced at the 5' end of the PCR fragment while a 
Spel site was introduced at the 3' end with primer 04. The PCR fragment obtained and 
the PHIL-D2-MOD vector were both restricted by Ncol and Spel, purified on agarose 
gel and ligated to create the integrative plasmid pRIT14598. 

5 

To construct pRIT14599, a 910bp DNA fragment corresponding to the nef-tat-His 
coding sequence was ligated between the EcoRI blunted(T4 polymerase) 
and Ncol sites of the PHIL-D2-MOD vector. The nef-tat-His coding fragment was 
obtained by Xbal blunted(T4 polymerase) and Ncol digestions of pRIT14596. 

10 

22 TRANSFORMATION OF PICHIA PASTORIS STRAIN GS115(his4), 



To obtain Pichia pastoris strains expressing Nef-His, Tat-His and the fusion Nef-Tat- 
His, strain GS1 15 was transformed with linear NotI fragments carrying the respective 
1 5 expression cassettes plus the HIS4 gene to complement his4 in the host 

genome.Transformation of GS 115 with Notl-linear fragments favors recombination at 
the AOXI locus. 



Multicopy integrant clones were selected by quantitative dot blot analysis and the type 
20 of integration, insertion (Mut + phenotype) or transplacement (Mufphenotype), 
was determined. 



From each transformation, one transformant showing a high production level for the 
recombinant protein was selected : 

25 

Strain Y1738 (Mut + phenotype) producing the recombinant Nef-His protein, 
a myristylated 215 amino acids protein which is composed of: 

°Myristic acid 

30 °A methionine, created by the use of Ncol cloning site of PHIL-D2-MOD 

vector 

°205 a.a. of Nef protein(starting at a.a.2 and extending to a.a.206) 
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°A threonine and a serine created by the cloning procedure (cloning at'Spel 

site of PHIL-D2-MOD vector. 
°One glycine and six histidines. 

5 Strain Y1739 (Muf phenotype) producing the Tat-His protein, a 95 amino acid 
protein which is composed of: 

°A methionine created by the use of Ncol cloning site 

°85 a.a. of the Tat protein(starting at a.a.2 and extending to a.a.86) 

10 

°A threonine and a serine introduced by cloning procedure 
°One glycine and six histidines 

Strain Y1737(Mut* phenotype) producing the recombinant Nef-Tat-His fusion protein, 
15 a myristylated 302 amino acids protein which is composed of: 

°Myristic acid 

°A methionine, created by the use of Ncol cloning site 
°205a.a. of Nef protein(starting at a.a.2 and extending to a.a.206) 
20 °A threonine and a serine created by the cloning procedure 

°85a.a. of the Tat protein(starting at a.a.2 and extending to a.a.86) 
°A threonine and a serine introduced by the cloning procedure 
°One glycine and six histidines 
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3. EXPRESSION OF HIV-1 Tat-MUTANT IN PICHIA PASTpRIS 

As well as a Nef-Tat mutant fusion protein, a mutant recombinant Tat protein has also 
5 been expressed. The mutant Tat protein must be biologically inactive while 
maintaining its immunogenic epitopes. 

A double mutant tat gene, constructed by D.Clements (Tulane University) was 
selected for these constructs. 

10 

This tat gene (originates from BH10 molecular clone) bears mutations in the active 
site region (Lys41-*Ala)and in RGD motif (Arg78-»Lys and Asp80->Glu) ( 
Virology 235: 48-64, 1997). 

1 5 The mutant tat gene was received as a cDNA fragment subcloned between the EcoRI 
and Hindlll sites within a CMV expression plasmid (pCMVLys41/KGE) 

3.1 CONSTRUCTION OF THE INTEGRATIVE VECTORS 
20 pRIT14912(encoding Tat mutant-His protein) and pRIT14913(encoding fusion 
Nef-Tat mutant-His). 

The tat mutant gene was amplified by PCR from the pCMVLys41/KGE plasmid with 
primers 05 and04 (see section 2.1 construction of pRIT 14598) 

25 

An Ncol restriction site was introduced at the 5' end of the PCR fragment while a 
Spel site was introduced at the 3' end with primer 04. The PCR fragment obtained and 
the PHIL-D2-MOD vector were both restricted by Ncol and Spel, purified on agarose 
gel and ligated to create the integrative plasmid pRIT14912 

30 
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To construct pRIT14913, the tat mutant gene was amplified by PCR from the 
pCMVLys41/KGE plasmid with primers 03 and 04 (see section 1.3.1 construction of 
pRIT 14596). 

5 The PCR fragment obtained and the plasmid pRITl 4597 (expressing Nef-His protein) 
were both digested by Spel restriction enzyme, purified on agarose gel and ligated to 
create the integrative plasmid pPJT14913 

3.2 TRANSFORMATION OF PICHIA PASTORIS STRAIN GS1 15. 

10 

Pichia pastoris strains expressing Tat mutant-His protein and the fusion Nef-Tat 
mutant-His were obtained, by applying integration and recombinant strain selection 
strategies previously described in section 2.2 . 

1 5 Two recombinant strains producing Tat mutant-His protein ,a 95 amino-acids protein, 
were selected: Yl 775 (Mut + phenotype) and Yl 716QA\tf phenotype). 

One recombinant strain expressing Nef-Tat mutant-His fusion protein, a 302 amino- 
acids protein was selected: Y 1 774(Mut + phenotype). 

20 
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4, PURIFICATION OF Nef-Tat-His FUSION PROTEIN (PICHIA 
PASTORIS) 

The purification scheme has been developed from 146g of recombinant Pichia 
pastoris cells (wet weight) or 2L Dyno-mill homogenate OD 55. The 
chromatographic steps are performed at room temperature. Between steps , Nef-Tat 
positive fractions are kept overnight in the cold room (+4°C) ; for longer time, 
samples are frozen at -20°C 



146g of Pichia pastoris cells 
Homogenization 



Buffer : 2L 50 mM PQ 4 pH7.0 
final OD:50 



Dyno-mill disruption (4 passes) 
Centrifiigation 



JA10 rotor / 9500 rpm/ 30 min / room 
temperature 



Dyno-mill Pellet 

Wash 
(lh - 4°C) 

Centrifiigation 
Pellet 



Buffer : +2L 10 mM P0 4 pH 7.5 - 
150mM-NaCl 0,5% empigen 



JA10 rotor / 9500 rpm/ 30 min / room 
temperature 
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Solubilisation 
(O/N - 4°C) 

Reduction 
(4H - room temperature - in the dark) 



Buffer : + 660ml 10 mM P0 4 pH 7.5 - 
150mM NaCl - 4.0M GuHCl 



+ 0,2M 2-mercaptoethanesulfonic acid, 
sodium salt (powder addition) / pH 
adjusted to 7.5 (with 0,5M NaOH 
solution) before incubation 



Carboxymethylation 
(1/2 h - room temperature - in the dark) 



+ 0,25M Iodoacetamid (powder addition) 
/ pH adjusted to 7.5 (with 0,5M NaOH 
solution) before incubation 



Immobilized metal ion affinity chromatography on 
Nf~-NTA- Agarose (Qiagen - 30 ml of resin) 



4> 

Dilution 



Equilibration buffer : 10 mM PCX, pH 7.5 
150mM NaCl - 4.0M GuHCl 



Washing buffer : 
buffer 



7.5 - 150mM 

7.5 - 150mM 
mM 



1) Equilibration 

2) 10mMPO 4 pH 
NaCl -6M Urea 

3) 10mMPO 4 pH 
NaCl -6M Urea- 25 
Imidazol 



Elution buffer 10 mM P0 4 pH 7.5 - 
150mM NaCl - 6M Urea - 0,5M Imidazol 



Down to an ionic strength of 18 mS/cm 2 

Dilution buffer : 10 mM P0 4 pH 7.5 - 6M 
Urea 



Cation exchange chromatography on SP Sepharose FF Equilibration buffer : 10 mM P0 4 pH 7.5 

^ u . , A . - . x - 150mM NaCl- 6.0M Urea 

(Pharmacia - 30 ml of resin) 



■21 
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Concentration 



Washing buffer : 1) Equilibration 
buffer 

2) 10 mM P0 4 pH 
7.5 - 250mM NaCl - 6M Urea 

Elution buffer : 10 mM Borate pH 9.0 - 
2M NaCl-6M Urea 



up to 5 mg/ml 

lOkDa Omega membrane(Filtron) 



Gel filtration chromatography on Superdex200 XK Elution buffer : 10 mM P0 4 pH 7.5 - 
1 6/60 1 50mM NaCl - 6M Urea 



(Pharmacia - 120 ml of resin) 

Dialysis 
(O/N - 4°C) 

Sterile filtration 



5 ml of sample / injection 5 injections 



Buffer : 10 mM P0 4 pH 6.8 - 150mM 
NaCl - 0,5M Arginin* 



Millex GV 0,22nm 



* ratio: 0,5M Arginin for a protein concentration of 160G|Ltg/ml. 
5 Purity 

The level of purity as estimated by SDS-PAGE is shown in Figure 4 by Daiichi 
Silver Staining and in Figure 5 by Coomassie blue G250. 



10 
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After Superdex200 step: > 95% 

After dialysis and sterile filtration steps: > 95% 

5 Recovery 

51mg of Nef-Tat-his protein are purified from 146g of recombinant Pichia pastoris 
cells (= 2L of Dyno-mill homogenate OD 55) 

10 5, VACCINE PREPARATION 

A vaccine prepared in accordance with the invention comprises the expression product 
of a DNA recombinant encoding an antigen as exemplified in example 1 or 2 and as 
adjuvant, the formulation comprising a mixture of 3 de -O-acylated monophosphoryl 
15 lipid A 3D-MPL and QS21 in an oil/water emulsion. 

3D-MPL: is a chemically detoxified form of the lipopolysaccharide (LPS) of the 
Gram-negative bacteria Salmonella minnesota. 

20 Experiments performed at Smith Kline Beecham Biologicals have shown that 

3D-MPL combined with various vehicles strongly enhances both the humoral and a 
TH1 type of cellular immunity. 

QS21: is one saponin purified from a crude extract of the bark of the Quillaja 
25 Saponaria Molina tree, which has a strong adjuvant activity: it activates both antigen- 
specific lymphoproliferation and CTLs to several antigens. 
Experiments performed at Smith Kline Beecham Biologicals have demonstrated a 
clear synergistic effect of combinations of 3D-MPL and QS21 in the induction of both 
humoral and TH1 type cellular immune responses. 

30 

The oil/water emulsion is composed of 2 oils (a tocopherol and squaiene), and of 
PBS containing Tween 80 as emulsifier. The emulsion comprised 5% squaiene 5% 
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tocopherol 0.4% Tween 80 and had an average particle size of 1 80 nm (see WO 
95/17210). 

Experiments performed at Smith Kline Beecham Biologicals have proven that the 
5 adjunction of this O/W emulsion to 3D-MPL/QS21 further increases their 
immunostimulant properties. 

Preparation of the oil/water emulsion (2 fold concentrate) 

10 Tween 80 is dissolved in phosphate buffered saline (PBS) to give a 2% solution in the 
PBS. To provide 100ml two fold concentrate emulsion 5g of DL alpha tocopherol 
and 5ml of squalene are vortexed to mix thoroughly. 90ml of PBS/Tween solution is 
added and mixed thoroughly. The resulting emulsion is then passed through a syringe 
and finally microfluidised by using an Ml 1 OS microfluidics machine. The resulting 

1 5 oil droplets have a size of approximately 180 nm. 

Preparation of oil in water formulation. 

Antigen prepared in accordance with example 1 or 2 (5|ig) was diluted in 10 fold 
20 concentrated PBS pH 6.8 and H 2 0 before consecutive addition of SB62, 3D-MPL 
(5^g)> QS21 (5 jig) and 50 |ig/ml thiomersal as preservative at 5 min interval. The 
emulsion volume is equal to 50% of the total volume (50|il for a dose of 100^x1). 

All incubations were carried out at room temperature with agitation. 

25 

6. IMMUNOGENICITY OF Tat AND Nef-Tat IN RODENTS 

Characterization of the immune response induced after immunization with Tat and 
30 NefTat was carried out. To obtain information on isotype profiles and cell-mediated 
immunity (CMI) two immunization experiments in mice were conducted. In the first 
experiment mice were immunized twice two weeks apart into the footpad with Tat or 
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NefTat in the oxydized or reduced form, respectively. Antigens were formulated in an 
oil in water emulsion comprising squalene, tween 80 Tm (polyoxyethylene sorbitan 
monooleate) QS21 , 3D-MPL and a-tocopherol, and a control group received the 
adjuvant alone. Two weeks after the last immunization sera were obtained and 

5 subjected to Tat-specific ELISA (using reduced Tat for coating) for the determination 
of antibody titers and isotypes (Figure 6a). The antibody titers were highest in the 
mice having received oxydized Tat. In general, the oxydized molecules induced 
higher antibody titers than the reduced forms, and Tat alone induced higher antibody 
titers than NefTat. The latter observation was confirmed in the second experiment. 

10 Most interestingly, the isotype profile of Tat-specific antibodies differed depending on 
the antigens used for immunization. Tat alone elicited a balanced IgGland IgG2a 
profile, while NefTat induced a much stronger T m bias (Figure 6b). This was again 
confirmed in the second experiment. 



15 In the second mouse experiment animals received only the reduced forms of the 
molecules or the adjuvant alone. Besides serological analysis (see above) 
lymphoproliferative responses from lymph node cells were evaluated. After 
restimulation of those cells in vitro with Tat or NefTat 3 H-thymidine incorporation 
was measured after 4 days of culture. Presentation of the results as stimulation indices 

20 indicates that very strong responses were induced in both groups of mice having 
received antigen (Figure 7). 

In conclusion, the mice studies indicate that Tat as well as Nef-Tat are highly 
immunogenic candidate vaccine antigens. The immune response directed against the 
25 two molecules is characterized by high antibody responses with at least 50% IgGl . 
Furthermore, strong CMI responses (as measured by lymphoproliferation) were 
observed. 

7. FUNCTIONAL PROPERTIES OF THE Tat AND Nef-Tat PROTEINS 

30 

The Tat and NefTat molecules in oxydized or reduced form were investigated for 

their ability to bind to human T cell lines. Furthermore, the effect on growth of 
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those cell lines was assessed. ELISA plates were coated overnight with different 
concentration of the Tat and NefTat proteins, the irrelevant gD from herpes simplex 
virus type II, or with a buffer control alone. After removal of the coating solution 
HUT-78 cells were added to the wells. After two hours of incubation the wells were 

5 washed and binding of cells to the bottom of the wells was assessed 

microscopically. As a quantitative measure cells were stained with toluidine blue, 
lysed by SDS, and the toluidine blue concentration in the supernatant was 
determined with an ELISA plate reader. The results indicate that all four proteins, 
Tat and NefTat in oxydized or reduced form mediated binding of the cells to the 

10 ELISA plate (Figure 8). The irrelevant protein (data not shown) and the buffer did 
not fix the cells. This indicates that the recombinantly expressed Tat-containing 
proteins bind specifically to human T cell lines. 

In a second experiment HUT-78 cells were left in contact with the proteins for 16 
15 hours. At the end of the incubation period the cells were labeled with [^H]- 

thymidine and the incorporation rate was determined as a measure of cell growth. 
All four proteins included in this assay inhibited cell growth as judged by 
diminished radioactivity incorporation (Figure 9). The buffer control did not 
mediate this effect. These results demonstrate that the recombinant Tat-containing 
20 proteins are capable of inhibiting growth of a human T cell line. 

In summary the functional characterization of the Tat and NefTat proteins reveals 
that these proteins are able to bind to human Tcell lines. Furthermore, the proteins 
are able to inhibit growth of such cell lines. 
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1. A vaccine composition which comprises a protein comprising 

(a) an HIV Tat protein or derivative thereof linked to either (i) a fusion partner 
or (ii) an HIV Nef protein or derivative thereof; or 

(b) an HIV Nef protein or derivative thereof linked to either (i) a fusion partner 
or (ii) an HIV Tat protein or derivative thereof; or 

(c) an HIV Nef protein or derivative thereof linked to an HIV Tat protein or 
derivative thereof and a fusion partner, 

in admixture with a pharmaceutical^ acceptable excipient. 



A composition as claimed in claim 1 comprising a Tat-Nef fusion protein or 
derivative thereof. 



A composition as claimed in claim 1 comprising a Nef-Tat fusion protein or 
derivative thereof. 



A composition according to any one of claims 1 to 3 wherein the derivative 
of the Tat protein is a mutated Tat protein. 

A composition according to any one of claims 1 to 4 wherein the derivative 
of the Nef protein is a mutated Nef protein. 

A composition as claimed in anv one of HairrK 1 - * wh^i 
partner is a lipoprotein or derivative thereof. 

A composition as claimed in claim 6 wherein the lipoprotein is Haemophilus 
Influenza B protein D or derivative thereof. 

A composition as claimed in claim 7 wherein the fusion partner comprises 
between 100-130 amino acid from the N terminal of Haemophilus Influenza 
B protein D. 
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9. A composition as claimed in any one of Claims 1 to 8, wherein the Tat 
protein is the entire Tat protein. 

10. A composition as claimed in any one of Claims 1 to 8, wherein the Nef 
protein is the entire Nef protein. 

11. A composition as claimed in any one of Claims 1 to 10, wherein the Tat 
protein is fused to an HIV Nef protein and a fusion partner. 

12. A composition as claimed in any one of claims 1 to 1 1, wherein the protein 
has a Histidine tail. 

13. A composition as claimed in any one of claims 1 to 12 wherein the protein is 
carboxymethylated. 

14. A composition as claimed in any one of claims 1 to 13, additionally 
comprising an adjuvant. 

15. A composition as claimed in claim 14, wherein the adjuvant is a TH1 
inducing adjuvant. 

16. A composition as claimed in claim 14 or 15 which adjuvant comprises 
mononhosnhnrvl linid A or a 

monophosphoryl lipid A. 

17. A composition as claimed in any one of claims 14 to 16 additionally 
comprising a saponin adjuvant. 

18. A composition as claimed in any one of claims 14 to 17 which additionally 
comprises an oil in water emulsion. 
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19. A composition as 



1 • • • 



• • • 



► • • • • • • I 

claimed in My one of clainK«l4».18 furtlj^r* copjprising 



HIV gpl60 or its derivative gpl20. 



20. A protein comprising an HIV Tat protein or derivative thereof linked to an 
HIV Nef protein or derivative thereof in Nef-Tat or Tat-Nef orientation. 



21. A nucleic acid encoding a protein of claim 20. 



22. A host transformed with a nucleic acid of claim 21 . 



23. A host as claimed in claim 22 wherein the host is either E.coli or Pichia 
pastoris. 



24. A method of producing a protein of claim 20, comprising providing a host as 
claimed in claim 22 or 23, expressing said protein and recovering the 
protein. 

25. A method of preparing (i) an HIV Nef protein or derivative thereof or (ii) an 
HIV Tat protein or derivative thereof in Pichia pastoris which method 
comprises the steps of transforming Pichia pastoris with DNA encoding said 
HIV Nef protein or derivative thereof or HIV Tat protein or derivative 
thereof, expressing said protein and recovering the protein. 

26. The method of claim 24 or claim 2^ further pnmnricina n 
carboxymethylation step performed on the expressed protein. 

27. A method of producing a vaccine, comprising admixing the protein from any 
one of claims 24 to 26 with a pharmaceutically acceptable diluent. 

28. The method of claim 27 further comprising the addition of HIV gpl60 or its 
derivative gpl20. 
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• I * * • • • * •••••• 

29. The method of claims 24 to 28" further comprising*** additiQil o£afl 
adjuvant, particularly a TH1 inducing adjuvant. 

30. A vaccine composition comprising a recombinant Tat-containing protein 
formulated with a mixture of 3D-MPL, QS21 and an oil in water emulsion 

31 . A composition as claimed in claim 30 wherein the oil in water emulsion 
comprises squalene, polyoxyethylene sorbitan monooleate and a-tocopherol. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 
(i) APPLICANT: SmithKline Beecham Biologicals S.A. 
(ii) TITLE OF THE INVENTION: Vaccine 



(iii) NUMBER OF SEQUENCES: 27 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SmithKline Beecham 

(B) STREET: Two New Horizons Court 

(C) CITY: Brentford 

(D) STATE: 

(E) COUNTRY: Middx, UK 

(F) zip: rm 9EP 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE; Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE : FastSEQ for Windows Version 2.0 

{vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE; 26-SEP-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY / AGENT INFORMATION: 
(A) NAME: Bor, Fiona R 

{B} REGISTRATION NUMBER: 

<C) REFERENCE / DOCKET NUMBER: 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 0181 975 2817 

(B) TELEFAX: 0181 975 €141 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
ATCGTCCATG . GGT . GGC + A AG * TGG . T 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
CGGCTACTAG TGCAGTTCTT GAA 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ATCGTACTAG T. GAG. CCA. GT A . GAT . C 
(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4; 
CGGCTACTAG TTTCCTTCGG GCCT 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
ATCGTCCATG GAGCCAGTAG ATC 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

ATGGATCCAA AAACTTTAGC CCTTTCTTTA TTAGCAGCTG GCGTACTAGC AGGTTGTAGC 60 

AGCCATTCAT CAAATATGGC GAATACCCAA ATGAAATCAG ACAAAATCAT TATTGCTCAC 120 

CGTGGTGCTA GCGGTTATTT AC CAGAGCAT ACGTTAGAAT CTAAAGCACT TGCTTTTGCA 180 

CAACAGGCTG ATTATTTAGA GCAAGATTTA GCAATGACTA AGGATGGTCG TTTAGTGGTT 240 

ATTCACGATC ACTTTTTAGA TGGCTTGACT GATGTTGCGA AAAAATTCCC ACATCGTCAT 300 

CGTAAAGATG GCCGTTACTA TGTCATCGAC TTTACCTTAA AAGAAATTCA AAGTTTAGAA 360 

AT G AC AG AAA ACTTTGAAAC CATGGCCACG TGTGATCAGA GCTCAACTAG TGGCCACCAT 420 

CACCATCACC ATTAATCTAG A 441 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 





Met 


Asp 


Pro 


Lys 


Thr 


Leu 


Ala 


Leu 


Ser 


Leu Leu Ala Ala Gly Val 


Leu 




1 






5 










10 








15 






Ala 


Gly 


Cys 


Ser 
20 


Ser 


His 


Ser 


Ser 


Asn 
25 


Met Ala 


Asn 


Thr 


Gin 
30 


Met 


Lys 




Ser 


Asp 


Lys 


He 


He 


He 


Ala 


His 


Arg 


Gly Ala 


Ser 


Gly 


Tyr 


Leu 


Pro 








35 










40 








45 










Glu 


His 
50 


Thr 


Leu 


Glu 


Ser 


Lys 
55 


Ala 


Leu 


Ala Phe 


Ala 
60 


Gin 


Gin 


Ala 


Asp 




Tyr 


Leu 


Glu 


Gin 


Asp 


Leu 


Ala 


Met 


Thr 


Lys Asp 


Gly Arg 


Leu 


Val 


Val 




65 










70 








75 










80 




He 


His 


Asp 


His 


Phe 
85 


Leu 


Asp 


Gly 


Leu 


Thr Asp 
90 


Val 


Ala 


Lys 


Lys 
95 


Phe 




Pro 


His 


Arg 


His 
100 


Arg 


Lys 


Asp 


Gly 


Arg 
105 


Tyr Tyr 


Val 


He 


Asp 
110 


Phe 


Thr 




Leu 


Lys 


Glu 
115 


He 


Gin 


Ser 


Leu 


Glu 
120 


Met 


Thr Glu 


Asn 


Phe 
125 


Glu 


Thr 


Met 




Ala 


Thr 
130 


Cys 


Asp 


Gin 


Ser 


Ser 
135 


Thr 


Ser 


Gly His 


His 
140 


His 


His 


His 


His 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

ATGGGTGGCA AGTGGTCAAA AAGTAGTGTG GTTGGATGGC CTACTGTAAG GGAAAGAATG 60 

AGACGAGCTG AGCCAGCAGC AGATGGGGTG GGAGCAGCAT CTCGAGACCT GGAAAAACAT 120 

GGAGCAATCA CAAGTAGCAA TACAGCAGCT ACCAATGCTG CTTGTGCCTG GCTAGAAGCA 180 

CAAGAGGAGG AGGAGGTGGG TTTTCCAGTC ACACCTCAGG TACCTTTAAG ACCAATGACT 240 

TACAAGGCAG CTGTAGATCT TAGCCACTTT TTAAAAGAAA AGGGGGGACT GGAAGGGCTA 300 

ATTCACTCCC AACGAAGACA AGATATCCTT GATCTGTGGA TCTACCACAC ACAAGGCTAC 360 

TTCCCTGATT GGCAGAACTA CACACCAGGG CCAGGGGTCA GATATCCACT GACCTTTGGA 420 

TGGTGCTACA AGCTAGTACC AGTTGAGCCA GATAAGGTAG AAGAGGCCAA TAAAGGAGAG 4 80 

AACACCAGCT TGTTACACCC TGTGAGCCTG CATGGAATGG ATGACCCTGA GAGAGAAGTG 540 

TTAGAGTGGA GGTTTGACAG CCGCCTAGCA TTTCATCACG TGGCCCGAGA GCTGCATCCG 600 

GAGTACTTCA AGAACTGCAC TAGTGGCCAC CATCACCATC ACCATTAA 648 

(2) INFORMATION FOR SEQ ID NO; 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 216 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



Met 


Giy 


Giy 


Lys 


Trp 


Ser 


Lys 


Ser 


Ser 


Val 


Val 


Giy 


Trp 


Pro 


Thr 


Val 


1 








5 










10 










15 




Arg 


Glu 


Arg 


Met 


Arg Arg Ala 


Glu 


Pro 


Ala 


Ala 


Asp 


Giy Val 


Giy Ala 








20 










25 










30 






Ala 


Ser 


Arg 


Asp 


Leu 


Glu 


Lys 


His 


Giy Ala 


He 


Thr 


Ser 


Ser 


Asn 


Thr 






35 










40 










45 








Ala 


Ala 
50 


Thr 


Asn 


Ala 


Ala 


Cys 
55 


Ala 


Trp 


Leu 


Glu 


Ala 
60 


Gin 


Glu 


Glu 


Glu 


Glu 


Vai 


Giy 


Phe 


Pro 


Val 


Thr 


Pro 


Gin 


Val 


Pro 


Leu 


Arg 


Pro 


Met 


Thr 


65 










70 










75 










80 


Tyr 


Lys 


Ala 


Ala 


Val 
85 


Asp 


Leu 


Ser 


His 


Phe 
90 


Leu 


Lys 


Glu 


Lys 


Giy 
95 


Giy 


Leu 


Glu 


Giy 


Leu 


lie 


His 


Ser 


Gin 


Arg Arg 


Gin 


Asp 


He 


Leu 


Asp 


Leu 








100 










105 










110 






Trp 


lie 


Tyr 
115 


His 


Thr 


Gin 


Giy 


Tyr 
120 


Phe 


Pro 


Asp 


Trp 


Gin 
125 


Asn 


Tyr 


Thr 


Pro 


Giy 
130 


Pro 


Giy 


Val 


Arg 


Tyr 
135 


Pro 


Leu 


Thr 


Phe 


Giy 
140 


Trp 


Cys 


Tyr 


Lys 


Leu 


Val 


Pro 


Val 


Glu 


Pro 


Asp 


Lys 


Val 


Glu 


Glu 


Ala 


Asn 


Lys 


Giy 


Glu 


145 










150 










155 










160 


Asn 


Thr 


Ser 


Leu 


Leu 
165 


His 


Pro 


Val 


Ser 


Leu 
170 


His 


Giy 


Met 


Asp 


Asp 
175 


Pro 


Glu Arg 


Glu 


Val 


Leu 


Glu 


Trp 


Arg 


Phe 


Asp 


Ser 


Arg 


Leu 


Ala 


Phe 


His 








180 










185 










190 






His 


Val 


Ala 
195 


Arg 


Glu 


Leu 


His 


Pro 
200 


Glu 


Tyr 


Phe 


Lys 


Asn 
205 


Cys 


Thr 


Ser 


Giy 


His 
210 


His 


His 


His 


His 


His 
215 





















(2) INFORMATION FOR SEQ ID NO: 10: 
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<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 288 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 10: 

ATGGAGCCAG TAGATCCTAG ACTAGAGCCC TGGAAGCATC CAGGAAGTCA GCCTAAAACT 
GCTTGTACCA ATTGCTATTG TAAAAAGTGT TGCTTTCATT GCCAAGTTTG TTTCATAACA 
AAAGCCTTAG GCATCTCCTA TGGCAGGAAG AAGCGGAGAC AGCGACGAAG ACCTCCTCAA 
GGCAGTCAGA CTCATCAAGT TTCTCTATCA AAGCAACCCA CCTCCCAATC CCGAGGGGAC 
CCGACAGGCC CGAAGGAAAC TAGTGGCCAC CATCACCATC ACCATTAA 



(2) INFORMATION FOR SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Glu Pro Val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser 

15 10 15 

Gin Pro Lys Thr Ala Cys Thr Asn Cys Tyr Cys Lys Lys Cys Cys Phe 

20 25 30 

His Cys Gin Val Cys Phe He Thr Lys Ala Leu Gly He Ser Tyr Gly 

35 40 45 

Arg Lys Lys Arg Arg Gin Arg Arg Arg Pro Pro Gin Gly Ser Gin Thr 

50 55 60 

His Gin Val Ser Leu Ser Lys Gin Pro Thr Ser Gin Ser Arg Gly Asp 
65 70 75 80 

Pro Thr Gly Pro Lys Glu Thr Ser Gly His His His His His His 
85 90 55 



(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 909 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

ATGGGTGGCA AGTGGTCAAA AAGTAGTGTG GTTGGATGGC CTACTGTAAG GGAAAGAATG 60 

AGACGAGCTG AGCCAGCAGC AGATGGGGTG GGAGCAGCAT CTCGAGACCT GGAAAAACAT 120 

GGAGCAAT C A CAAGTAGCAA TACAGCAGCT ACCAATGCTG CTTGTGCCTG GCTAGAAGCA 180 

CAAGAGGAGG AGGAGGTGGG TTTTCCAGTC ACACCTCAGG TACCTTTAAG ACCAATGACT 240 

TACAAGGCAG CTGTAGATCT TAGCCACTTT TTAAAAGAAA AGGGGGGACT GGAAGGGCTA 300 

ATTCACTCCC AACGAAGACA AGATATCCTT GATCTGTGGA TCTACCACAC ACAAGGCTAC 360 
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TTCCCTGATT GGCAGAACTA CACACCAGGG 
TGGTGCTACA AGCTAGTACC AGTTGAGCCA 
AACACCAGCT TGTTACACCC TGTGAGCCTG 
TTAGAGTGGA GGTTTGACAG CCGCCTAGCA 
GAGTACTTCA AGAACTGCAC TAGTGAGCCA 
CCAGGAAGTC AGCCTAAAAC TGCTTGTACC 
TGCCAAGTTT GTTTCATAAC AAAAGCCTTA 
CAGCGACGAA GACCTCCTCA AGGCAGTCAG 
ACCTCCCAAT CCCGAGGGGA CCCGACAGGC 
CACCATTAA 

(2) INFORMATION FOR SEQ ID NO 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 303 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 





Met 


Giy 


Giy 


Lys 


Trp 


Ser 


Lys 


Ser 


Ser 


Val 


Val 


Giy Trp 


Pro 


Thr Val 




1 








5^ 










10 










15 




Arg 


Glu 


Arg 


Met 


Arg 


Arg 


Ala 


Glu 


Pro 


Ala 


Ala 


Asp 


Giy Val Giy Ala 










20 










25 










30 






Ala 


Ser 


Arg 


Asp 


Leu 


Glu 


Lys 


His 


Giy 


Ala 


He 


Thr 


Ser 


Ser 


Asn Thr 








35 










40 










45 








Ala 


Ala 


Thr 


Asn 


Ala 


Ala 


Cys 


Ala 


Trp 


Leu 


Glu 


Ala 


Gin 


Glu 


Glu Glu 






50 










55 










60' 










Glu 


Val 


Giy 


Phe 


Pro 


Val 


Thr 


Pro 


Gin 


Val 


Pro 


Leu Arg 


Pro 


Met Thr 




65 










70 










75 








80 




Tyr 


Lys 


Ala 


Ala 


Val 


Asp 


Leu 


Ser 


His 


Phe 


Leu 


Lys 


Glu 


Lys 


Giy Giy 


j i 1 










85 










90 










95 


Q 


Leu 


Glu 


Giy 


Leu 


He 


His 


Ser 


Gin 


Arg 


Arg 


Gin 


Asp 


He 


Leu Asp Leu 










100 










105 










110 






Trp 


lie 


Tyr 


His 


Thr 


Gin 


Giy Tyr 


Phe 


Pro 


Asp 


Trp 


Gin 


Asn 


Tyr Thr 








115 










120 










125 








Pro 


Giy 
130 


Pro 


Giy 


Val 


Arg 


Tyr 
135 


Pro 


Leu 


Thr 


Phe 


Giy 
140 


Trp 


Cys 


Tyr Lys 




Leu 


Val 


Pro 


Val 


Glu 


Pro 


Asp 


Lys 


Val 


Glu 


Glu 


Ala 


Asn 


Lys 


Giy Glu 




145 










150 










155 








160 




Asn 


Thr 


Ser 


Leu 


Leu 


His 


Pro 


Val 


Ser 


Leu 


His 


Giy Met 


Asp 


Asp Pro 












165 










170 










175 




Glu 


Arg 


Glu 


Val 


Leu 


Glu 


Trp Arg 


Phe 


Asp 


Ser 


Arg 


Leu 


Ala 


Phe His 










180 










185 










190 






His 


Val 


Ala 
195 


Arg 


Glu 


Leu 


His 


Pro 
200 


Glu 


Tyr 


Phe 


Lys 


Asn 
205 


Cys 


Thr Ser 




Glu 


Pro 


Val 


Asp 


Pro 


Arg 


Leu 


Glu 


Pro 


Trp 


Lys 


His 


Pro 


Giy Ser Gin 






210 










215 










220 










Pro 


Lys 


Thr 


Ala 


Cys 


Thr 


Asn 


Cys 


Tyr 


Cys 


Lys 


Lys 


Cys 


Cys 


Phe His 




225 










230 










235 








240 




Cys 


Gin 


Val 


Cys 


Phe 


He 


Thr 


Lys 


Ala 


Leu Giy 


He 


Ser 


Tyr 


Giy Arg 












245 










250 










255 




Lys 


Lys 


Arg 


Arg 


Gin 


Arg 


Arg Arg 


Pro 


Pro 


Gin Giy 


Ser 


Gin 


Thr His 










260 










265 










270 






Gin 


Val 


Ser 


Leu 


Ser 


Lys 


Gin 


Pro 


Thr 


Ser 


Gin 


Ser 


Arg 


Giy Asp Pro 



CCAGGGGTCA GATATCCACT GACCTTTGGA 420 

GATAAGGTAG AAGAGGCCAA TAAAGGAGAG 480 

CATGGAATGG ATGACCCTGA GAGAGAAGTG 540 

TTTCATCACG TGGCCCGAGA GCTGCATCCG 600 

GTAGATCCTA GACTAGAGCC CTGGAAGCAT 660 

AATTGCTATT GTAAAAAGTG TTGCTTTCAT 720 

GGCATCTCCT ATGGCAGGAA GAAGCGGAGA 780 

ACTCATCAAG TTTCTCTATC AAAGCAACCC 840 

CCGAAGGAAA CTAGTGGCCA CCATCACCAT 900 

909 

:13: 
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275 280 285 

Thr Gly Pro Lys Glu Thr Ser Gly His His His His His His 
290 295 300 

(2) INFORMATION FOR SEQ ID NO:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1029 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

ATGGATCCAA AAACTTTAGC CCTTTCTTTA TTAGCAGCTG GCGTACTAGC AGGTTGTAGC 60 

AGCCATTCAT CAAATATGGC GAATACCCAA AT GAAATCAG ACAAAATCAT TATTGCTCAC 120 

CGTGGTGCTA GCGGTTATTT ACCAGAGCAT ACGTTAGAAT CTAAAGCACT TGCTTTTGCA 180 

CAACAGGCTG ATTATTTAGA GCAAGATTTA GCAATGACTA AGGATGGTCG TTTAGTGGTT 240 

ATTCACGATC ACTTTTTAGA TGGCTTGACT GATGTTGCGA AAAAATTCCC ACATCGTCAT 300 

CGTAAAGATG GCCGTTACTA TGTCATCGAC TTTACCTTAA AAGAAATTCA AAGTTTAGAA 360 

AT G AC AG AAA ACTTTGAAAC CATGGGTGGC AAGTGGTCAA AAAGTAGTGT GGTTGGATGG 420 

CCTACTGTAA GGGAAAGAAT GAGACGAGCT GAGCCAGCAG CAGATGGGGT GGGAGCAGCA 4 80 

TCTCGAGACC TGGAAAAACA TGGAGCAATC ACAAGTAGCA ATACAGCAGC TACCAATGCT 540 

GCTTGTGCCT GGCTAGAAGC ACAAGAGGAG GAGGAGGTGG GTTTTCCAGT CACACCTCAG 600 

GTACCTTTAA GACCAATGAC TTACAAGGCA GCTGTAGATC TTAGCCACTT TTTAAAAGAA 660 

AAGGGGGGAC TGGAAGGGCT AATTCACTCC CAACGAAGAC AAGATATCCT TGATCTGTGG 720 

ATCTACCACA CACAAGGCTA CTTCCCTGAT TGGCAGAACT ACACACCAGG GCCAGGGGTC 780 

AGATATCCAC TGACCTTTGG ATGGTGCTAC AAGCTAGTAC CAGTTGAGCC AGATAAGGTA 840 

GAAGAGGCCA ATAAAGGAGA GAACACCAGC TTGTTACACC CTGTGAGCCT GCATGGAATG 900 

GATGACCCTG AGAGAGAAGT GTTAGAGTGG AGGTTTGACA GCCGCCTAGC ATTTCATCAC 960 

GTGGCCCGAG AGCTGCATCC GGAGTACTTC AAGAACTGCA CTAGTGGCCA CCATCACCAT 1020 

CACCATTAA 1029 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 325 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



Cys 


Ser 


Ser 


His 


Ser 


Ser 


Asn 


Met 


Ala 


Asn Thr 


Gin 


Met 


Lys 


Ser 


Asp 


1 








5 










10 








15 




Lys 


He 


lie 


He 


Ala 


His 


Arg 


Gly Ala 


Ser Gly 


Tyr 


Leu 


Pro 


Glu 


His 








20 










25 








30 






Thr 


Leu 


Glu 


Ser 


Lys 


Ala 


Leu 


Ala 


Phe Ala Gin Gin Ala Asp Tyr Leu 






35 










40 








45 








Glu 


Gin 


Asp 


Leu 


Ala 


Met 


Thr 


Lys 


Asp 


Gly Arg 


Leu 


Val 


Val 


He 


His 




50 










55 








60 










Asp 


His 


Phe 


Leu 


Asp 


Gly 


Leu 


Thr 


Asp 


Val Ala 


Lys 


Lys 


Phe 


Pro 


His 


65 










70 








75 










80 


Arg 


His 


Arg 


Lys 


Asp Gly Arg 


Tyr 


Tyr 


Val He 


Asp 


Phe 


Thr 


Leu 


Lys 



85 90 95 
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Glu 


He 


Gin 


Ser 


Leu 


Glu 


Met 


Thr 


Glu 


Asn 


Phe 


Glu 


Thr 


Met 


Gly Gly 








100 










105 










110 






Lys 


Trp 


Ser 


Lys 


Ser 


Ser 


Val 


Val 


Gly Trp 


Pro 


Thr 


Val 


Arg 


Glu 


Arg 






115 










120 










125 








Met 


Arg 


Arg 


Ala 


Glu 


Pro 


Ala 


Ala 


Asp 


Gly Val 


Gly 


Ala 


Ala 


Ser 


Arg 




130 










135 










140 










Asp 


Leu 


Glu 


Lys 


His 


Gly Ala 


He 


Thr 


Ser 


Ser 


Asn 


Thr 


Ala 


Ala 


Thr 


145 










150 










155 










160 


Asn 


Ala 


Ala 


Cys 


Ala 


Trp 


Leu 


Glu 


Ala 


Gin 


Glu 


Glu 


Glu 


Glu 


Val 


Gly 










165 










170 










175 




Phe 


Pro 


Val 


thr 


Pro 


Gin 


Val 


Pro 


Leu 


Arg 


Pro 


Met 


Thr 


Tyr 


Lys 


Ala 








180 










185 










190 






Ala 


Val 


Asp 


Leu 


Ser 


His 


Phe 


Leu 


Lys 


Glu 


Lys 


Gly 


Gly 


Leu 


Glu 


Gly 






195 










200 










205 








Leu 


He 


His 


Ser 


Gin Arg Arg 


Gin 


Asp 


He 


Leu 


Asp 


Leu 


Trp 


He 


Tyr 




210 










215 










220 










His 


Thr 


Gin 


Gly 


Tyr 


Phe 


Pro 


Asp 


Trp 


Gin 


Asn 


Tyr 


Thr 


Pro 


Gly 


Pro 


225 










230 










235 










240 


Gly 


Val 


Arg 


Tyr 


Pro 


Leu 


Thr 


Phe 


Gly 


Trp Cys 


Tyr 


Lys 


Leu 


Val 


Pro 










245 










250 










255 




Val 


Glu 


Pro 


Asp 


Lys 


Val 


Glu 


Glu 


Ala 


Asn 


Lys 


Gly 


Glu 


Asn 


Thr 


Ser 








260 










265 










270 






Leu 


Leu 


His 


Pro 


Val 


Ser 


Leu 


His 


Gly 


Met 


Asp 


Asp 


Pro 


Glu 


Arg 


Glu 






275 










280 










285 








Val 


Leu 


Glu 


Trp 


Arg 


Phe 


Asp 


Ser 


Arg 


Leu 


Ala 


Phe 


His 


His 


Val 


Ala 




290 










295 










300 










Arg 


Glu 


Leu 


His 


Pro 


Glu 


Tyr 


Phe 


Lys 


Asn 


Cys 


Thr 


Ser 


Gly 


His 


His 


305 










310 










315 










320 


His 


His 


His 


His 



























(2) INFORMATION FOR SEQ ID NO: 16: 

fi} SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1290 base pairs 
<B) TYPE: nucleic acid 
(C5 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

ATGGATCCAA AAACTTTAGC CCTTTCTTTA TTAGCAGCTG GCGTACTAGC AGGTTGTAGC 60 

AGCCATTCAT CAAATATGGC GAATACCCAA ATGAAATCAG ACAAAATCAT TAT TGCTC AC 120 

CGTGGTGCTA GCGGTTATTT ACCAGAGCAT ACGTTAGAAT CTAAAGCACT TGCGTTTGCA 180 

CAACAGGCTG ATTATTTAGA GCAAGATTTA GCAATGACTA AGGATGGTCG TTTAGTGGTT 240 

ATTCACGATC ACTTTTTAGA TGGCTTGACT GATGTTGCGA AAAAATTCCC ACATCGTCAT 300 

CGTAAAGATG GCCGTTACTA TGTCATCGAC TTTACCTTAA AAGAAATTCA AAGTTTAGAA 360 

AT G AC AG AAA ACTTTGAAAC CATGGGTGGC AAGTGGTCAA AAAGTAGTGT GGTTGGATGG 420 

CCTACTGTAA GGGAAAGAAT GAGACGAGCT GAGCCAGCAG CAGATGGGGT GGGAGCAGCA 480 

TCTCGAGACC TGGAAAAACA TGGAGCAATC ACAAGTAGCA ATACAGCAGC TACCAATGCT 540 

GCTTGTGCCT GGCTAGAAGC ACAAGAGGAG GAGGAGGTGG GTTTTCCAGT CACACCTCAG 600 

GTACCTTTAA GACCAATGAC TTACAAGGCA GCTGTAGATC TTAGCCACTT TTTAAAAGAA 660 

AAGGGGGGAC TGGAAGGGCT AATTCACTCC CAACGAAGAC AAGATATCCT TGATCTGTGG 720 

ATCTACCACA CACAAGGCTA CTTCCCTGAT TGGCAGAACT ACACACCAGG GCCAGGGGTC 780 

AGATATCCAC TGACCTTTGG ATGGTGCTAC AAGCTAGTAC CAGTTGAGCC AGATAAGGTA 840 

GAAGAGGCCA ATAAAGGAGA GAACACCAGC TTGTTACACC CTGTGAGCCT GCATGGAATG 900 
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GATGACCCTG AGAGAGAAGT GTTAGAGTGG AGGTTTGACA GCCGCCTAGC ATTTCATCAC 960 

GTGGCCCGAG AGCTGCATCC GGAGTACTTC AAGAACTGCA CTAGTGAGCC AGTAGATCCT 1020 

AGACTAGAGC CCTGGAAGCA TCCAGGAAGT CAGCCTAAAA CTGCTTGTAC CAATTGCTAT 1080 

TGTAAAAAGT GTTGCTTTCA TTGCCAAGTT TGTTTCATAA CAAAAGCCTT AGGCATCTCC 1140 

TATGGCAGGA AGAAGCGGAG ACAGCGACGA AGACCTCCTC AAGGCAGTCA GACTCATCAA 1200 

GTTTCTCTAT CAAAGCAACC CACCTCCCAA TCCCGAGGGG ACCCGACAGG CCCGAAGGAA 1260 

ACTAGTGGCC ACCATCACCA TCACCATTAA 1290 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



Cys 


Ser 


Ser 


His 


Ser 


Ser 


Asn 


Met 


Ala 


Asn 


Thr 


Gin 


Met 


Lys 


Ser 


Asp 


1 








5 










10 










15 




Lys 


lie 


He 


He 


Ala 


His 


Arg 


Gly 


Ala 


Ser 


Gly 


Tyr 


Leu 


Pro 


Glu 


His 








20 










25 










30 






Thr 


Leu 


Glu 


Ser 


Lys 


Ala 


Leu 


Ala 


Phe 


Ala 


Gin 


Gin 


Ala 


Asp 


Tyr 


Leu 






35 










40 










45 








Glu 


Gin 


Asp 


Leu 


Ala 


Met 


Thr 


Lys 


Asp 


Gly Arg 


Leu 


Val 


Val 


He 


His 




50 










55 










60 










Asp 


His 


Phe 


Leu 


Asp 


Gly 


Leu 


Thr 


Asp 


Val 


Ala 


Lys 


Lys 


Phe 


Pro 


His 


65 










70 










75 










80 


Arg 


His 


Arg 


Lys 


Asp 


Gly 


Arg 


Tyr 


Tyr 


Val 


He Asp 


Phe 


Thr 


Leu 


Lys 










85 










90 










95 




Glu 


lie 


Gin 


Ser 


Leu 


Glu 


Met 


Thr 


Giu 


Asn 


Phe 


Glu 


Thr 


Met 


Gly Gly 








100 










105 










110 






Lys 


Trp 


Ser 


Lys 


Ser 


Ser 


Val 


Val 


Gly Trp 


Pro 


Thr 


Val 


Arg 


Glu 


Arg 






115 










120 










125 








Met 


Arg 


Arg 


Ala 


Glu 


Pro 


Ala 


Ala 


Asp 


Gly Val 


Gly Ala 


Ala 


Ser 


Arg 




130 










135 










140 










Asp 


Leu 


Glu 


Lys 


His 


Gly 


Ala 


He 


Thr 


Ser 


Ser 


Asn 


Thr 


Ala 


Ala 


Thr 


145 










150 










155 










160 


Asn 


Ala 


Ala 


Cys 


Ala 


Trp 


Leu 


Glu 


Ala 


Gin 


Glu 


Glu 


Glu 


Glu 


Vai 


Gly 










165 










170 










175 




Phe 


Pro 


Val 


Thr 


Pro 


Gin 


Val 


Pro 


Leu 


Arg 


Pro 


Met 


Thr 


Tyr 


Lys 


Ala 








180 










185 










190 






Ala 


Vai 


Asp 


Leu 


Ser 


His 


Phe 


Leu 


Lys 


Glu 


Lys 


Gly 


Gly 


Leu Glu Gly 






195 










200 










205 








Leu 


He 


His 


Ser 


Gin 


Arg 


Arg Gin Asp 


He 


Leu 


Asp 


Leu 


Trp 


lie 


Tyr 




210 










215 










220 










His 


Thr 


Gin 


Gly 


Tyr 


Phe 


Pro 


Asp 


Trp 


Gin 


Asn 


Tyr 


Thr 


Pro 


Gly 


Pro 


225 










230 










235 










240 


Gly Vai 


Arg 


Tyr 


Pro 


Leu 


Thr 


Phe 


Gly Trp 


Cys 


Tyr 


Lys 


Leu 


Val 


Pro 










245 










250 










255 




Val 


Giu 


Pro 


Asp 


Lys 


Val 


Glu 


Glu 


Ala 


Asn 


Lys 


Gly Glu Asn Thr 


Ser 








260 










265 










270 






Leu 


Leu 


His 


Pro 


Val 


Ser 


Leu 


His 


Gly Met 


Asp 


Asp 


Pro 


Glu Arg 


Glu 






275 










280 










285 








Val 


Leu 


Glu 


Trp 


Arg 


Phe 


Asp 


Ser 


Arg 


Leu 


Ala 


Phe 


His 


His 


Val 


Ala 



290 295 300 
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Arg 


Glu 


Leu 


His 


Pro 


Glu 


Tyr 


Phe 


Lys 


Asn 


Cys 


Thr 


Ser 


Glu 


Pro 


Val 


305 










310 










315 










320 


Asp 


Pro 


Arg 


Leu 


Glu 
325 


Pro 


Trp 


Lys 


His 


Pro 
330 


Gly 


Ser 


Gin 


Pro 


Lys 
335 


Thr 


Ala 


Cys 


Thr 


Asn 
340 


Cys 


Tyr 


Cys 


Lys 


Lys 
345 


Cys 


Cys 


Phe 


His 


Cys 
350 


Gin 


Val 


Cys 


Phe 


He 
355 


Thr 


Lys 


Ala 


Leu 


Gly 
360 


He 


Ser 


Tyr 


Gly 


Arg 
365 


Lys 


Lys 


Arg 


Arg 


Gin 
370 


Arg 


Arg 


Arg 


Pro 


Pro 
375 


Gin 


Gly 


Ser 


Gin 


Thr 
380 


His 


Gin 


Val 


Ser 


Leu 


Ser 


Lys 


Gin 


Pro 


Thr 


Ser 


Gin 


Ser 


Arg 


Gly 


Asp 


Pro 


Thr Gly 


Pro 


385 










390 










395 










400 


Lys 


Glu 


Thr 


Ser 


Gly 


His 


His 


His 


His 


His 


His 













405 410 



(2) INFORMATION FOR SEQ ID NO; 18: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 981 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 18; 

ATGGATCCAA GCAGCCATTC ATCAAATATG 
ATTATTGCTC ACCGTGGTGC TAGCGGTTAT 
CTTGCGTTTG CACAACAGGC TGATTATTTA 
CGTTTAGTGG TTATTCACGA TCACTTTTTA 
CCACATCGTC ATCGTAAAGA TGGCCGTTAC 
CAAAGTTTAG AAATGACAGA AAACTTTGAA 
GTGGTTGGAT GGCCTACTGT AAGGGAAAGA 
GTGGGAGCAG CATCTCGAGA CCTGGAAAAA 
GCTACCAATG CTGCTTGTGC CTGGCTAGAA 
GTCACACCTC AGGTACCTTT AAGACCAATG 
TTTTTAAAAG AAAAGGGGGG ACTGGAAGGG 
CTTGATCTGT GGATCTACCA CACACAAGGC 
GGGCCAGGGG TCAGATATCC ACTGACCTTT 
CCAGATAAGG TAGAAGAGGC CAATAAAGGA 
CTGCATGGAA TGGATGACCC TGAGAGAGAA 
GCATTTCATC ACGTGGCCCG AGAGCTGCAT 
CACCATCACC ATCACCATTA A 

(2) INFORMATION FOR SEQ ID NO 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 amino acids 

(B) TYPE: amino acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Asp Pro Ser Ser His Ser Ser Asn Met Ala Asn Thr Gin Met Lys 
15 10 15 



GCGAATACCC AAATGAAATC AGACAAAATC 60 

TTACCAGAGC ATACGTTAGA ATCTAAAGCA 120 

GAGCAAGATT TAGCAATGAC TAAGGATGGT 180 

GATGGCTTGA CTGATGTTGC GAAAAAATTC 240 

TATGTCATCG ACTTTACCTT AAAAGAAATT 300 

ACCATGGGTG GCAAGTGGTC AAAAAGTAGT 360 

ATGAGACGAG CTGAGCCAGC AGCAGATGGG 420 

CATGGAGCAA TCACAAGTAG CAATACAGCA 480 

GCACAAGAGG AGGAGGAGGT GGGTTTTCCA 54 0 

ACTTACAAGG CAGCTGTAGA TCTTAGCCAC 600 

CTAATTCACT CCCAACGAAG ACAAGATATC 660 

TACTTCCCTG ATTGGCAGAA CTACACACCA 720 

GGATGGTGCT ACAAGCTAGT ACCAGTTGAG 780 

GAGAACACCA GCTTGTTACA CCCTGTGAGC 840 

GTGTTAGAGT GGAGGTTTGA CAGCCGCCTA 900 

CCGGAGTACT TCAAGAACTG CACTAGTGGC 960 

981 

:19: 



SUBSTITUTE SHEET (RULE 25) 



WO 99/16884 



11 / 15 



PCT/EP98/06040 





Ser 


Asp 


Lys 


He 


He 


He 


Ala 


His 


Arg 


Gly Ala 


Ser Gly 


Tyr 


Leu 


Pro 




















ZD 
















Glu 


His 


Thr 
35 


Leu 


Glu 


Ser 


Lys 


Ala 

ft u 


Leu 


Ala 


Phe 


Ala Gin 


Gin 


Ala 


Asp 




Tyr 


Leu 
50 


Glu 


Gin 


Asp 


Leu 


Ala 


Met 


Thr 


Lys 


Asp 


Gly Arg 
fin 


Leu 


Val 


Val 




He 


His 


Asp 


His 


Phe 


l-itr la 




uxy 


x>e u 


Thr Asp 


Val Ala 


Lys 


Lys 


Phe 




65 




















75 








80 




Pro 


His 


Arg 


mo 


Arg 

0 3 


Lys 


Asp 


vjxy 


Arg 


Tyr 
90 


Tyr 


Val He 


Asp 


Phe 
95 


Thr 




T on 




vj-LU. 


T1 A 

1UU 


PI n 


Ser 


Leu 


<aXU 




Thr 


Glu 


Asn Phe 


Glu 
110 


Thr 


Met 




uiy 


v3 x y 


ijy o 

1 T C 
11 J 


irp 


oer 


Lys 


Ser 


C r-. ~~ 


vax 


Val 


Gly 


Trp Pro 
125 


Thr 


Val 


Arg 




Glu 


A y*n 






Arg 


a i = 
rvxa 


pin 


Pro 


nia 


Ala Asp Gly Val 


Gly Ala Ala 






1 




















140 










Ser 


Arg 


Asp 




VJ-i. U 


Lys 


His 


Gly 


A 1 » 


lie 


Thr 


Ser Ser 


Asn 


Thr 


Ala 




j. i j 










X.3U 










155 








160 






Thr 




Ala 
•Mica 


fix a 


Cys 


Ala 


Trp 


Leu 


Glu 


Ala 


Gin Glu 


Glu 


Glu 


Glu 






















170 








175 






Val 


Gly 


Phe 


Pro 


Val 


Thr 


Pro 


Gin 


Val 


Pro 


Leu 


Arg Pro 


Met 


Thr 


Tyr 


Iff 








ion 
XoU 










185 








190 








ijys 




ciXa 


vax 


Asp 


Leu 


Ser 


His 


rfie 


Leu 


Lys 


Glu Lys 


Gly 


Gly 


Leu 


Q 
Jl 






1 OR 

X1?D 










200 








205 










oXU 


G1 \7 


Leu 


x xe 


HIS 


Ser 


Gin Arg 


Arg 


Gin Asp 


He Leu 


Asp 


Leu 


Trp 
















215 










220 












xyr 




i nr 


uxn 


Gly Tyr 


Phe 


Pro 


Asp 


Trp 


Gin .Asn 


Tyr 


Thr 


Pro 














230 










235 








240 


Q 


uiy 


IT x. u 


oxy 


v ax 


Arg 


Tyr 


Pro 


Leu 


inr 


Phe 
250 


Gly 


Trp Cys 


Tyr 


Lys 
255 


Leu 




Val 


Pro 


Val 


Glu 


Pro 


Asp 


Lys 


Val 


Glu 


Glu 


Ala 


Asn Lys 


Gly 


Glu 


Asn 










260 










265 








270 








Thr 


Ser 


Leu 


Leu 


His 


Pro 


Val 


Ser 


Leu 


His 


Gly Met Asp 


Asp 


Pro 


Glu 








275 










280 








285 










Arg 


Glu 


Val 


Leu 


Glu 


Trp Arg 


Phe 


Asp 


Ser 


Arg 


Leu Ala 


Phe 


His 


His 






290 










295 










300 










Val 


Ala 


Arg 


Glu 


Leu 


His 


Pro 


Glu 


Tyr 


Phe 


Lys 


Asn Cys 


Thr 


Ser 


Gly 




305 










310 










315 








320 




His 


His 


His 


His 


His 
325 


His 





















(2) INFORMATION FOR SEQ ID NO; 20; 

£i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH; 1242 base pairs 
£B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 20; 

ATGGATCCAA GCAGCCATTC ATCAAATATG GCGAATACCC AAATGAAATC AGACAAAATC 60 

ATTATTGCTC ACCGTGGTGC TAGCGGTTAT TTACCAGAGC ATACGTTAGA ATCTAAAGCA 120 

CTTGCGTTTG CACAACAGGC TGATTATTTA GAGCAAGATT TAGCAATGAC TAAGGATGGT 180 

CGTTTAGTGG TTATTCACGA TCACTTTTTA GATGGCTTGA CTGATGTTGC GAAAAAATTC 24 0 

CCACATCGTC ATCGTAAAGA TGGCCGTTAC TATGTCATCG ACTTTACCTT AAAAGAAATT 300 
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CAAAGTTTAG AAATGACAGA AAACTTTGAA ACCATGGGTG GCAAGTGGTC AAAAAGTAGT 360 

GTGGTTGGAT GGCCTACTGT AAGGGAAAGA ATGAGACGAG CTGAGCCAGC AGCAGATGGG 420 

GTGGGAGCAG CATCTCGAGA CCTGGAAAAA CATGGAGCAA TCACAAGTAG CAATACAGCA 480 

GCTACCAATG CTGCTTGTGC CTGGCTAGAA GCACAAGAGG AGGAGGAGGT GGGTTTTCCA 540 

GTCACACCTC AGGTACCTTT AAGACCAATG ACTTACAAGG CAGCTGTAGA TCTTAGCCAC 600 

TTTTTAAAAG AAAAGGGGGG ACTGGAAGGG CTAATTCACT CCCAACGAAG ACAAGATATC 660 

CTTGATCTGT GGATCTACCA CACACAAGGC TACTTCCCTG ATTGGCAGAA CTACACACCA 720 

GGGCCAGGGG TCAGATATCC ACTGACCTTT GGATGGTGCT ACAAGCTAGT ACCAGTTGAG 7 80 

CCAGATAAGG TAGAAGAGGC CAATAAAGGA GAGAACACCA GCTTGTTACA CCCTGTGAGC 840 

CTGCATGGAA TGGATGACCC T GAG AG AG AA GTGTTAGAG? GGAGGTTTGA CAGCCGCCTA 900 

GCATTTCATC ACGTGGCCCG AGAGCTGCAT CCGGAGTACT TCAAGAACTG CACTAGTGAG 960 

CCAGTAGATC CTAGACTAGA GCCCTGGAAG CATCCAGGAA' GTCAGCCTAA AACTGCTTGT 1020 

ACCAATTGCT ATTGTAAAAA GTGTTGCTTT CATTGCCAAG TTTGTTTCAT AACAAAAGCC 1080 

TTAGGCATCT CCTATGGCAG GAAGAAGCGG AGACAGCGAC GAAGACCTCC TCAAGGCAGT 114 0 

CAGACTCATC AAGTTTCTCT ATCAAAGCAA CCCACCTCCC AAT CCCGAGG GGACCCGACA 1200 

GGCCCGAAGG AAACTAGTGG CCACCATCAC CATCACCATT AA 124 2 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
(A} LENGTH: 414 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



Met Asp 


Pro 


Ser 


Ser 


His 


Ser 


Ser 


Asn 


Met 


Ala 


Asn 


Thr 


Gin 


Met 


Lys 


1 






5 










10 










15 


Ser Asp 


Lys 


He 
20 


lie 


He 


Ala 


His 


Arg 
25 


Giy 


Ala 


Ser 


Gly 


Tyr 
30 


Leu 


Pro 


Glu His 


Thr 


Leu 


Glu 


Ser 


Lys 


Ala 


Leu 


Ala 


Phe 


Ala 


Gin 


Gin 


Ala 


Asp 




35 










40 










45 






Tyr Leu 


Glu 


Gin 


Asp 


Leu 


Ala 


Met 


Thr 


Lys 


Asp Gly Arg Leu Val 


Val 


50 










55 










60 










lie His 


Asp 


His 


Phe 


Leu 


Asp 


Gly 


Leu 


Thr 


Asp 


Val 


Ala 


Lys 


Lys 


Phe 


65 








70 










75 










80 


Pro His 


Arg 


His 


Arg 
85 


Lys 


Asp 


Gly 


Arg 


Tyr 
90 


Tyr 


Vai 


lie 


Asp 


Phe 
95 


Thr 


Leu Lys 


Glu 


He 
100 


Gin 


Ser 


Leu 


Glu 


Met 
105 


Thr 


Glu 


Asn 


Phe 


Glu 
110 


Thr 


Met 


Gly Gly 


Lys 
115 


Trp 


Ser 


Lys 


Ser 


Ser 
120 


Val 


Val 


Gly 


Trp 


Pro 
125 


Thr 


Val 


Arg 


Glu Arg 


Met 


Arg 


Arg 


Ala 


Glu 


Pro 


Ala 


Ala 


Asp Gly Val Gly Ala Ala 


130 










135 










140 










Ser Arg 


Asp 


Leu 


Glu 


Lys 


His 


Gly 


Ala 


He 


Thr 


Ser 


Ser 


Asn 


Thr 


Ala 


145 








150 










155 










160 


Ala Thr 


Asn 


Ala 


Ala 
165 


Cys 


Ala 


Trp 


Leu 


Glu 
170 


Ala 


Gin 


Glu 


Glu 


Glu 
175 


Glu 


Val Gly 


Phe 


Pro 


Val 


Thr 


Pro 


Gin 


Val 


Pro 


Leu 


Arg 


Pro 


Met 


Thr 


Tyr 






180 










185 










190 




Lys Ala 


Ala 


Val 


Asp 


Leu 


Ser 


His 


Phe 


Leu 


Lys 


Glu 


Lys 


Gly Gly Leu 




195 










200 










205 








Glu Gly 


Leu 


He 


His 


Ser 


Gin 


Arg 


Arg 


Gin 


Asp 


He 


Leu Asp 


Leu 


Trp 


210 










215 










220 










He Tyr 


His 


Thr 


Gin 


Gly 


Tyr 


Phe 


Pro 


Asp 


Trp 


Gin 


Asn 


Tyr 


Thr 


Pro 
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225 230 235 240 



Gly 


Pro 


Gly 


Val 


Arg Tyr 


Pro 


Leu 


Thr 


Phe 


Gly 


Trp 


Cys 


Tyr 


Lys 


Leu 










245 










250 










255 




Val 


Pro 


Val 


Glu 


Pro Asp 


Lys 


Val 


Glu 


Glu 


Ala 


Asn 


Lys 


Gly 


Glu 


Asn 








260 










265 










270 






Thr 


Ser 


Leu 


Leu 


His 


Pro 


Val 


Ser 


Leu 


His 


Gly Met 


Asp 


Asp 


Pro 


Glu 






275 










280 










285 








Arg 


Glu 
290 


Val 


Leu 


Glu 


Trp 


Arg 
295 


Phe 


Asp 


Ser 


Arg 


Leu 
300 


Ala 


Phe 


His 


His 


Val 


Ala 


Arg 


Glu 


Leu 


His 


Pro 


Glu 


Tyr 


Phe 


Lys 


Asn 


Cys 


Thr 


Ser 


Glu 


305 










310 










315 










320 


Pro 


Val 


Asp 


Pro 


Arg 
325 


Leu 


Glu 


Pro 


Trp 


Lys 
330 


His 


Pro 


Gly 


Ser 


Gin 
335 


Pro 


Lys 


Thr 


Ala 


Cys 
340 


Thr 


Asn 


Cys 


Tyr 


Cys 
345 


Lys 


Lys 


Cys 


Cys 


Phe 
350 


His 


Cys 


Gin 


Val 


Cys 


Phe 


lie 


Thr 


Lys 


Ala 


Leu 


Gly 


He 


Ser 


Tyr 


Gly Arg Lys 






355 










360 










365 








Lys 


Arg 


Arg 


Gin 


Arg 


Arg 


Arg 


Pro 


Pro 


Gin Gly 


Ser 


Gin 


Thr 


His 


Gin 




370 










375 










380 










Val 


Ser 


Leu 


Ser 


Lys 


Gin 


Pro 


Thr 


Ser 


Gin 


Ser 


Arg 


Gly Asp 


Pro 


Thr 


385 










390 










395 










400 


Gly 


Pro 


Lys 


Glu 


Thr 


Ser 


Gly 


His 


His 


His 


His 


His 


His 









405 410 



{2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 288 base pairs 

(B) TYPE: nucleic acid 

(C) S T RAN DEDNES S : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

ATGGAGCCAG TAGATCCTAG ACTAGAGCCC TGGAAGCATC CAGGAAGTCA GCCTAAAACT 60 

GCTTGTACCA ATTGCTATTG TAAAAAGTGT TGCTTTCATT GCCAAGTTTG TTTCATAACA 120 

GCTGCCTTAG GCATCTCCTA TGGCAGGAAG AAGCGGAGAC AGCGACGAAG ACCTCCTCAA 180 

GGCAGTCAGA CTCATCAAGT TTCTCTATCA AAGCAACCCA CCTCCCAATC CAAAGGGGAG 240 

CCGACAGGCC CGAAGGAAAC TAGTGGCCAC CATCACCATC ACCATTAA 288 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Met Glu Pro Val Asp Pro Arg Leu Glu Pro 

15 10 
Gin Pro Lys Thr Ala Cys Thr Asn Cys Tyr 

20 25 
His Cys Gin Val Cys Phe He Thr Ala Ala 



Trp Lys His Pro Gly Ser 
15 

Cys Lys Lys Cys Cys Phe 
30 

Leu Gly He Ser Tyr Gly 
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35 40 45 

Arg Lys Lys Arg Arg Gin Arg Arg Arg Pro Pro Gin Gly Ser Gin Thr 

50 55 60 

His Gin Val Ser Leu Ser Lys Gin Pro Thr Ser Gin Ser Lys Gly Glu 
65 70 75 80 

Pro Thr Gly Pro Lys Glu Thr Ser Gly His His His His His His 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 909 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

ATGGGTGGCA AGTGGTCAAA AAGTAGTGTG GTTGGATGGC CTACTGTAAG GGAAAGAATG 60 

AGACGAGCTG AGCCAGCAGC AGATGGGGTG GGAGCAGCAT CTCGAGACCT GGAAAAACAT 120 

GGAGCAATCA CAAGTAGCAA TACAGCAGCT ACCAATGCTG CTTGTGCCTG GCTAGAAGCA 180 

CAAGAGGAGG AGGAGGTGGG TTTTCCAGTC ACACCTCAGG TACCTTTAAG ACCAATGACT 240 

TACAAGGCAG CTGTAGATCT TAGCCACTTT TTAAAAGAAA AGGGGGGACT GGAAGGGCTA 300 

ATTCACTCCC AACGAAGACA AGATATCCTT GATCTGTGGA TCTACCACAC ACAAGGCTAC 360 

TTCCCTGATT GGCAGAACTA CACACCAGGG CCAGGGGTCA GATATCCACT GACCTTTGGA 420 

TGGTGCTACA AGCTAGTACC AGTTGAGCCA GATAAGGTAG AAGAGGCCAA TAAAGGAGAG 480 

AACACCAGCT TGTTACACCC TGTGAGCCTG CATGGAATGG ATGACCCTGA GAGAGAAGTG 54 0 

TTAGAGTGGA GGTTTGACAG CCGCCTAGCA TTTCATCACG TGGCCCGAGA GCTGCATCCG 600 

GAGTACTTCA AGAACTGCAC TAGTGAGCCA GTAGATCCTA GACTAGAGCC CTGGAAGCAT 660 

CCAGGAAGTC AGCCTAAAAC TGCTTGTACC AATTGCTATT GTAAAAAGTG TTGCTTTCAT 720 

TGCCAAGTTT GTTTCATAAC AGCTGCCTTA GGCATCTCCT ATGGCAGGAA GAAGCGGAGA 780 

CAGCGACGAA GACCTCCTCA AGGCAGTCAG ACTCATCAAG TTTCTCTATC AAAGCAACCC 840 

ACCTCCCAAT CCAAAGGGGA GCCGACAGGC CCGAAGGAAA CTAGTGGCCA CCATCACCAT 900 

CACCATTAA 909 

{2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 303 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Gly Gly Lys Trp Ser Lys Ser Ser Val Val Gly Trp Pro Thr Val 

1 5 10 15 

Arg Glu Arg Met Arg Arg Ala Glu Pro Ala Ala Asp Gly Val Gly Ala 

20 25 30 

Ala Ser Arg Asp Leu Glu Lys His Gly Ala lie Thr Ser Ser Asn Thr 

35 40 45 

Ala Ala Thr Asn Ala Ala Cys Ala Trp Leu Glu Ala Gin Glu Glu Glu 

50 55 60 

Glu Val Gly Phe Pro Val Thr Pro Gin Val Pro Leu Arg Pro Met Thr 
65 70 75 80 
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Tyr 


Lys 


Ala 


Ala 


Val 
85 


Asp 


Leu 


Ser 


His 


Phe 
90 


Leu 


Lys 


Glu 


Lys 


Gly 
95 


Gly 


Leu 


Glu Gly 


Leu 


He 


His 


Ser 


Gin 


Arg Arg 


Gin Asp 


lie 


Leu Asp 


Leu 








100 










105 










110 






Trp 


lie 


Tyr 
115 


His 


Thr 


Gin 


Gly 


Tyr 
120 


Phe 


Pro 


Asp 


Trp 


Gin 
125 


Asn 


Tyr 


Thr 


Pro 


Gly 
130 


Pro 


Gly 


Val 


Arg 


Tyr 
135 


Pro 


Leu 


Thr 


Phe 


Gly 
140 


Trp 


Cys 


Tyr 


Lys 


Leu 


Val 


Pro 


Val 


Glu 


Pro 


Asp 


Lys 


Val 


Glu 


Glu 


Ala 


Asn 


Lys 


Gly 


Glu 


145 










150 










155 










160 


Asn 


Thr 


Ser 


Leu 


Leu 


His 


Pro 


Val 


Ser 


Leu 


His 


Gly 


Met 


Asp Asp 


Pro 










165 










170 










175 




Glu 


Arg 


Glu 


Val 


Leu 


Glu 


Trp 


Arg 


Phe 


Asp 


Ser Arg 


Leu 


Ala 


Phe 


His 








180 










185 










190 






His 


Val 


Ala 
195 


Arg 


Glu 


Leu 


His 


Pro 
200 


Glu 


Tyr 


Phe 


Lys 


Asn 
205 


Cys 


Thr 


Ser 


Glu 


Pro 
210 


Val 


Asp 


Pro 


Arg 


Leu 
215 


Glu 


Pro 


Trp 


Lys 


His 
220 


Pro 


Gly 


Ser 


Gin 


Pro 


Lys 


Thr 


Ala 


Cys 


Thr 


Asn 


Cys 


Tyr 


Cys 


Lys 


Lys 


Cys 


Cys 


Phe 


His 


225 










230 










235 










240 


Cys 


Gin 


Val 


Cys 


Phe 
245 


He 


Thr 


Ala 


Ala 


Leu 
250 


Gly 


He 


Ser 


Tyr 


Gly 
255 


Arg 


Lys 


Lys 


Arg 


Arg 


Gin Arg Arg Arg 


Pro 


Pro Gin Gly 


Ser 


Gin 


Thr 


His 








260 










265 










270 






Gin 


Val 


Ser 
275 


Leu 


Ser 


Lys 


Gin 


Pro 
280 


Thr 


Ser 


Gin 


Ser 


Lys 
285 


Gly 


Glu 


Pro 


Thr 


Gly 


Pro 


Lys 


Glu 


Thr 


Ser 


Gly 


His 


His 


His 


His 


His 


His 







290 295 300 



(2) INFORMATION FOR SEQ ID NO; 26; 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 57 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

TTCGAAACCA TGGCCGCGGA CTAGTGGCCA CCATCACCAT CACCATTAAC GGAATTC 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Thr Ser Gly His His His His His His 
1 5 
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B/ Coding sequence of the first 127 amino acids 
of protein D and multiple cloning site. The signal 
sequence is underlined. 

BamHI 

ATG GAT CCA AAA ACT TTA GCC CTT TCT TTA TT A GCA OCT GGC GTA CTA GCA GGT TGT AGC AGC 

Met Asp Pro Lys Thr Leu Ala Leu Ser Leu Leu Ala Ala Gly Val Leu Ala Gly Cys Ser Ser 

CAT TCA TCA AAT ATG GCG AAT ACC CAA ATG AAA TCA G AC AAA ATC ATT ATT GCT CAC CGT GGT 

His Ser Ser Asn Met Ala Asti Thr Gin Met Lys Ser Asp Lys He He He Ala His Arg Gly 

GCT AGC GGT TAT TTA CCA GAG CAT ACG TTA GAA TCT AAA GCA CTT GCT TTT GCA CAA CAG GCT 

Ala Ser Gly Tyr Leu Pro Glu His Thr Leu Glu Ser Lys Ala Leu Ala Phe Ala Gin Gin Ala 

GAT TAT TTA GAG CAA GAT TTA GCA ATG ACT AAG GAT GGT CGT TTA GTG GTT ATT CAC GAT CAC 

Asp Tyr Leu Glu Gin Asp Leu Ala Met Thr Lys Asp Gly Arg Leu Val Val lie His Asp His 

TTT TTA GAT GGC TTG ACT GAT GTT GCG AAA AAA TTC CCA CAT CGT CAT CGT AAA GAT GGC CGT 

Phe Leu Asp Gly Leu Thr Asp Val Ala Lys Lys Phe Pro His Arg His Arg Lys Asp Gly Arg 

TAC TAT GTC ATC G AC TTT ACC TTA AAA GAA ATT GAA AGT TTA GAA ATG AC A GAA AAC TTT GAA 

Tyr Tyr Vkl He Asp Phe Thr Leu Lys Glu He Gin Ser Leu Glu Met Thr Glu Asn Phe Glu 

Nc °I Spel Xbal 

ACC ATG GCC ACG TGT GAT CAG AGC TCA ACT AGT GGA CAC CAT CAC CAT CAC CAT TAA TCTATjA 

Thr Mel Ala Thr Cys Asp Gin Ser Ser Thr Ser Gly His His His His His His * 



The amino acid sequence of Figure 1 relates to Seq, ID no. 7 and the nucleic acid sequence of 
Figure 1 relates to Seq. ID. No. 6. 
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The DNA and amino acid sequences of Nef-His; Tat-His; Nef-Tat-His fusion and 
mutated Tat is illustrated. 

Pichia-expressed constructs (plain constructs) 
=> Nef-HIS 

DNA sequence (Seq. ID. No. 8) 

ATGGGTGGCAAGTGGTCAAAAAGTAGTGTGGTTGGATGGCCTACTGTAAGGGAAAGA 
ATGAGACGAGCTGAGCCAGCAGCAGATGGGGTGGGAGCAGCATCTCGAGACCTGGAA 
AAACATGGAGCAATCAGAAGTAGCAATACAGCAGCTACCAATGCTGCTTGTGCCTGG 
CTAGAAGCACAAGAGGAGGAGGAGGTGGGTTTTCCAGTCACACCTCAGGTACCTTTA 
AGACCAATGACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGG 
GGACTGGAAGGGCTAATTCACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATC 
TACCACACACAAGGCTACTTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTC 
AGATAT CCACTGACCTTTGGATGGTGCTACAAGCTAGTACCAGTTGAGC CAGATAAG 
GTAGAAGAGGCCAATAAAGGAGAGAACACCAGCTTGTTACACCCTGTGAGCCTGCAT 
GGAATGGATGACCCTGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCA 
TTTCATCACGTGGCCCGAGAGCTGCATCCGGAGTACTTCAAGAACTGCACTAGTGGC 
CACCATCACCATCACCATTAA 

Protein sequence(Seq. ID. No. 9) 



MGGKWS KS S WGWPTVRERMRRAE P AADGVGAASRDLEKHGAI T S SNTAATNAACAW 
LEAQEEEEVGFPVTPQVPLRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWI 
YHTQGYFPDWQNYTPGPGVRYPLTFGWCYKLVPVEPDKVEEANKGENTSLLHPVSLH 
GMDDPEREVLEWRFDSRIlAFHHVARELHPEYFKNCTSGHHHHHH . 



=> Tat -HIS 

DNA sequence (Seq. ID. No. 10) 



ATGGAGCCAGTAGATCCTAGACTAGAGCCCTGGAAGCATCCAGGAAGTCAGCCTAAA 
ACTGCTTGTACCAATTGCTATTGTAAAAAGTGTTGCTTTCATTGCCAAGTTTGTTTC 
ATAACAAAAGCCTTAGGCATCTCCTATGGCAGGAAGAAGCGGAGACAGCGACGAAGA 
CCTCCTCAAGGCAGTCAGACTCATCAAGTTTCTCTATCAAAGCAACCCACCTCCCAA 
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TCCCGAGGGGACCCGACAGGCCCGAAGGAAACTAGTGGCCACCATCACCATCACCAT 
TAA 

Protein sequence (Seq. ID. No. 11) 

MEPVDPRLEPWKHPGSQPKTACTNCYCKKCCFHCQVCF ITKALGI S YGRKKRRQRRR 
PPQGSQTHQVSLSKQPTSQSRGDPTGPKETSGHHHHHH. 



=> Nef-Tat-HIS 

DNA sequence (Seq. ID. No. 12) 

ATGGGTGGCAAGTGGTCAAAAAGTAGTGTGGTTGGATGGCCTACTGTAAGGGAAAGA 

ATGAGACGAG CTGAG C CAGCAGCAG ATGGGGTGGGAGC AGC AT CT CGAGAC CTGGAA 

AAACATGGAGCAATCACAAGTAGCAATACAGCAGCTACCAATGCTGCTTGTGCCTGG 

CTAGAAGCACAAGAGGAGGAGGAGGTGGGTTTTCCAGTCACACCTCAGGTACCTTTA 

AGACCAATGACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGG 

GGACTGGAAGGGCTAATTCACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATC 

TACCACACACAAGGCTACTTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTC 

AGATATCCACTGACCTTTGGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAG 

GTAGAAGAGGCCAATAAAGGAGAGAACACCAGCTTGTTACACCCTGTGAGCCTGCAT 

GGAATGGATGACCCTGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCA 

TTTCATCACGTGGCCCGAGAGCTGCATCCGGAGTACTTCAAGAACTGCACTAGTGAG 

CCAGTAGATCCTAGACTAGAGCCCTGGAAGCATCCAGGAAGTCAGCCTAAAACTGCT 

TGTACCAATTGCTATTGTAAAAAGTGTTGCTTTCATTGCCAAGTTTGTTTCATAACA 

AAAGCCTTAGGCATCTCCTATGGCAGGAAGAAGCGGAGACAGCGACGAAGACCTCCT 

CAAGGCAGTCAGACTCATCAAGTTTCTCTATCAAAGCAACCCACCTCCCAATCCCGA 

GGGGACCCGACAGGCCCGAAGGAAACTAGTGGCCACCATCACCATCACCATTAA 

Protein sequence(Seq. ID. No. 13) 



MGGKWS KS S WGWPTVRERMRRAE P AADGVGAAS RDLE KHGAI T S SNTAATNAACAW 
LEAQEEEEVGFPVTPQVPLRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWI 
YHTQGYFPDWQNYTPGPGVRYPLTFGWCYKLVPVEPDKVEEANKGENTSLLHPVSLH 
GMDDPEREVLEWRFDSRLAFHHVARELHPEYFKNCTSEPVDPRLEPWKHPGSQPKTA 
CTNCYCKKCCFHCQVCFITKALGISYGRKKRRQRRRPPQGSQTHQVSLSKQPTSQSR 
GDPTGPKETSGHHHHHH. 



E.coli-expressed constructs (fusion constructs) 



z=> LipoD-Nef-HIS 
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Nucleotides corresponding to the Prot D Fusion Partner are in bold. 
The Lipidation Signal Sequence is underlined. After processing, the cysteine 
coded by the TGT codon, indicated with a star, becomes the amino terminal 
residue which is then modified by covalently bound fatty acids. 

ATGGATCCAAAAACTTTAGCCCTTTCTTTATTAGCAGCTGGCGTACTAGCAGGT TGT 
AGC AGC CATT C ATCAAATATGG C GAATAC C C AAATGAAAT C AGAC AAAATCATTATT 
GCTCACCGTGGTGCTAGCGGTTATTTACCAGAGCATACGTTAGAATCTAAAGCACTT 
GCTTTTGCACAACAGGCTGATTATTTAGAGCAAGATTTAGCAATGACTAAGGATGGT 
CGTTTAGTGGTTATTCACGATCACTTTTTAGATGGCTTGACTGATGTTGCGAAAAAA 
TTCCCACATCGTCATCGTAAAGATGGCCGTTACTATGTCATCGACTTTACCTTAAAA 
GAAATTCAAAGTTTAGAAATGACAGAAAACTTTGAAACCATGGGTGGCAAGTGGTCA 
AAAAGTAGTGTGGTTGGATGGCCTACTGTAAGGGAAAGAATGAGACGAGCTGAGCCA 
GCAGCAGATGGGGTGGGAGCAG CAT CT CGAGACCTGGAAAAAC ATGGAGCAATGAC A 
AGT AGC AATAC AGC AGCT AC CAATG CTG CTTGTGCC TGGCT AGAAG C ACAAGAGGAG 
GAGGAGGTGGGTTTTCCAGTCACACCTCAGGTACCTTTAAGACCAATGACTTACAAG 
GCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAAGGGCTAATT 
CACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGCTAC 
TTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACCTTT 
GGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAA 
GGAGAGAACAC CAG CTTGTT AC AC CCTGTGAGC CTGCATGGAATGGATGAC C CTGAG 
AGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCATTTCATCACGTGGCCCGA 
GAGCTGCATC CGGAGTACTTCAAGAACTGCACTAGTGGC CAC CATC AC CATCACCAT 
TAA 

Protein sequence of the processed lipidated ProtD-NefHIS protein (Seq. ID. No. 15) 
(Amino-acids corresponding to Prot D fusion partner are in bold) 



CSSHSSNMANTQMKSDKIIIAHRGASGYLPEHTLESKALAFAQQADYLEQDLAMTKD 
GRLWIHDHFLDGLTDVAKKFPHRHRKDGRYYVIDFTLKEIQSLEMTENFETMGGKW 

S KS S WGWPTVRERMRRAEPAADGVGAASRDLEKHGA I T S SNT AATNAACAWLE AQE 
EEEVGFPVTPQVPLRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHTQG 
YF P D WQN YT PGPG VR YPLTFGW C YKL VP VE PDKVEEANKGENTS LLHP VS LHGMDD P 
EREVLEWRFDSRLAFHHVARELHPEYFKNCTSGHHHHHH . 



LipoD-Nef-Tat-HIS 

DNA sequence (Seq. ID. No. 16) 
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Nucleotides corresponding to the Prot D Fusion Partner are in bold. 
The Lipidation Signal Sequence is underlined. After processing, the cysteine 
coded by the TGT codon, indicated with a star, becomes the amino terminal 
residue which is then modified by covaiently bound fatty acids. 

* 

ATGGATCCAAAAACTTTAGCCCTTTCTTTATTAGCAGCTGGCGTACTAGCAGGT TGT 
AGCAGCC ATT C AT C AAATATGGC GAATAC C CAAATGAAATC AGAC AAAATC ATTATT 
GCTCACCGTGGTGCTAGCGGTTATTTACCAGAGCATACGTTAGAATCTAAAGCACTT 
GCGTTTGCACAACAGGCTGATTATTTAGAGCAAGATTTAGCAATGACTAAGGATGGT 
CGTTTAGTGGTTATTCACGATCACTTTTTAGATGGCTTGACTGATGTTGCGAAAAAA 
TTCCCACATCGTCATCGTAAAGATGGCCGTTACTATGTCATCGACTTTACCTTAAAA 
GAAATTCAAAGTTTAGAAATGACAGAAAACTTTGAAACCATGGGTGGCAAGTGGTCA 
AAAAGT AGTGTGGTTGG ATGG C CTACTGT AAGGGAAAGAATGAGACGAG CTGAG C CA 
GCAGCAGATGGGGTGGGAGCAGCATCTCGAGACCTGGAAAAACATGGAGCAATCACA 
AGTAGCAATACAGCAGCTACCAATGCTGCTTGTGCCTGGCTAGAAGCACAAGAGGAG 
GAGGAGGTGGGTTTTCCAGTCACACCTCAGGTACCTTTAAGACCAATGACTTACAAG 
GCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAAGGGCTAATT 
CACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGCTAC 
TTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACCTTT 
GGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAA 
GGAGAGAACACCAGCTTGTTACACCCTGTGAGCCTGCATGGAATGGATGACCCTGAG 
AGAGAAGTGTT AGAGTGGAGGTTTGACAGCCGC CTAG CATTT CAT CACGTGGC C CGA 
G AGCTGCATC CGG AGT ACTT CAAGAACTGCACTAGTGAGC CAGTAGATC CTAGACTA 
GAGCCCTGGAAGCATCCAGGAAGTCAGCCTAAAACTGCTTGTACCAATTGCTATTGT 
AAAAAGTGTTGCTTTCATTGCCAAGTTTGTTTCATAACAAAAGCCTTAGGCATCTCC 
TATGGCAGGAAGAAGCGGAGACAGCGACGAAGACCTCCTCAAGGCAGTCAGACTCAT 
CAAGTTTCTCTATCAAAGCAACCCACCTCCCAATCCCGAGGGGACCCGACAGGCCCG 
AAGGAAACTAGTGGCCACCATCACCATCACCATTAA 



Protein sequence of the processed lipidated ProtD-NEF-TA T-HIS protein (Seq. ID. No, 

m 

(Amino-acids corresponding to Prot D fusion partner are in bold) 

CSSHSSNMANTQMKSDKIIIAHRGASGYLPEHTLESKALAFAQQADYLEQDLAMTKD 
GRLWIHDHFLDGLTDVAKKFPHRHRKDGRYYVIDFTLKEIQSLEMTENFETMGGKW 

SKSSWGWPTVRERMRRAEPAADGVGAASRDLEKHGAITSSNTAATNAACAWLEAQE 
EEEVGFPVTPQVPLRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHTQG 
YFPDWQNYTPGPGVRYPLTFGWCYKLVPVEPDKVEEANKGENTSLLHPVSLHGMDDP 
EREVLEWRFDSRLAFHHVARELHPEYFKNCTSEPVDPRLEPWKHPGSQPKTACTNCY 
CKKCCFHCQVCF I TKALG I S YGRKKRRQRRRPPQGSQTHQVSLS KQ PTSQSRGD PTG 
PKETSGHHHHHH . 
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=> ProtD-Nef-HIS 

DNA sequence (Seq. ID. No. 18) 

Nucleotides corresponding to the Prot D Fusion Partner are in bold. 

ATGGATCCAAGCAGCCATTCATCAAATATGGCGAATACCCAAATGAAATCAGACAAA 

ATCATTATTGCTCACCGTGGTGCTAGCGGTTATTTACCAGAGCATACGTTAGAATCT 

AAAGCACTTGCGTTTGCACAACAGGCTGATTATTTAGAGCAAGATTTAGCAATGACT 

AAGGATGGTCGTTTAGTGGTTATTCACGATCACTTTTTAGATGGCTTGACTGATGTT 

GCGAAAAAATTCCCACATCGTCATCGTAAAGATGGCCGTTACTATGTCATCGACTTT 

ACCTTAAAAGAAATTCAAAGTTTAGAAATGACAGAAAACTTTGAAACCATGGGTGGC 

AAGTGGTCAAAAAGTAGTGTGGTTGGATGGCCTACTGTAAGGGAAAGAATGAGACGA 

GCTGAGCCAGCAGCAGATGGGGTGGGAGCAGCATCTCGAGACCTGGAAAAACATGGA 

GCAATCACAAGTAGCAATACAGCAGCTACCAATGCTGCTTGTGCCTGGCTAGAAGCA 

CAAGAGGAGGAGGAGGTGGGTTTTCCAGTCACACCTCAGGTACCTTTAAGACCAATG 

ACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAA 

GGGCTAATTCACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATCTACCACACA 

CAAGGCTACTTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCA 

CTGACCTTTGGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGAG 

GCCAATAAAGGAGAGAACACCAGCTTGTTACACCCTGTGAGCCTGCATGGAATGGAT 

GACCCTGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCATTTCATCAC 

GTGGCCCGAGAGCTGCATCCGGAGTACTTCAAGAACTGCACTAGTGGCCACCATCAC 
CATCACCATTAA 



Protein sequence (Seq. ID. No. 19) 

(Ammo-acids corresponding to Prot D fusion partner are in bold) 

MDPSSHSSNMANTQMKSDKIIIAHRGASGYLPEHTLESKALAFAQQADYL 
EQDLAMTKDGRLWIHDHFLDGLTDVAKKFPHRHRKDGRYYVIDFTLK 

EIQSLEMTENFETMGGKWSKSSVVGWPTVRERMRRAEPAADGVGAASRDL 
EKHGAITSSNTAATNAACAWLEAQEEEEVGFPVTPQVPLRPMTYKAAVDLSH 
FLKEKGGLEGLIHSQRRQDILDLWIYHTQGYFPDWQNYTPGPGVRYPLTFGW 
CYKLVPVEPDKVEEANKGENTSLLHPVSLHGMDDPEREVLEWRFDSRLAFH 
HVARELHPEYFKNCTSGHHHHHH . 



=> ProtD-Nef-Tat-HIS 

DNA sequence (Seq. ID. No. 20) 
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Nucleotides corresponding to the Prot D Fusion Partner are in bold. 

ATGGATCCAAGCAGCCATTCATCAAATATGGCGAATACCCAAATGAAATCAGACAAA 
ATCATTATTGCTCACCGTGGTGCTAGCGGTTATTTACCAGAGCATACGTTAGAATCT 
AAAGCACTTGCGTTTGCACAACAGGCTGATTATTTAGAGCAAGATTTAGCAATGACT 
AAGGATGGTCGTTTAGTGGTTATTCACGATCACTTTTTAGATGGCTTGACTGATGTT 
GCGAAAAAATTCCCACATCGTCATCGTAAAGATGGCCGTTACTATGTCATCGACTTT 
ACCTTAAAAGAAATTCAAAGTTTAGAAATGACAGAAAACTTTGAAACCATGGGTGGC 
AAGTGGTCAAAAAGTAGTGTGGTTGGATGGCCTACTGTAAGGGAAAGAATGAGACGA 
GCTGAGCCAGCAGCAGATGGGGTGGGAGCAGCATCTCGAGACCTGGAAAAACATGGA 
GCAATCACAAGTAGCAATACAGCAGCTACCAATGCTGCTTGTGCCTGGCTAGAAGCA 
CAAGAGGAGGAGGAGGTGGGTTTT C CAGT CACAC CT CAGGT ACCTTTAAGAC CAATG 
ACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAA 
GGGCTAATTCACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATCTACCACACA 
CAAGGCTACTTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCA 
CTGACCTTTGGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGAG 
GCCAATAAAGGAGAGAACACCAGCTTGTTACACCCTGTGAGCCTGCATGGAATGGAT 
GACCCTGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCATTTCATCAC 
GTGGCCCGAGAGCTGCATCCGGAGTACTTCAAGAACTGCACTAGTGAGCCAGTAGAT 
CCTAGACTAGAGCCCTGGAAGCATCCAGGAAGTCAGCCTAAAACTGCTTGTACCAAT 
TGCTATTGTAAAAAGTGTTGCTTTCATTGCCAAGTTTGTTTCATAACAAAAGCCTTA 
GGCATCTCCTATGGCAGGAAGAAGCGGAGACAGCGACGAAGACCTCCTCAAGGCAGT 
CAGACTCATCAAGTTTCTCTATCAAAGCAACCCACCTCCCAATCCCGAGGGGACCCG 
ACAGGC CCGAAGGAAACTAGTGGCCAC CATCACCATCAC CATTAA 



Protein sequence (Seq. ID. No. 21) 

(Amino-acids corresponding to Prot D fusion partner are in bold) 

MD P S SHS SNMANTQMKSDKI I IAHRGASGYLPEHTLESKALAFAQQAD YLEQDL AMT 
KDGRLWIHDHFLDGLTDVAKKPPHRHRKDGRYYVIDFTLKEIQSLEMTENFETMGG 

KWSKSSVVGWPTVRERMRRAEPAADGVGAASRDLEKHGAITSSNTAATNAACAWLEA 
QEEEEVGF PVTPQVPLRPMTYKAAVDLSHFLKE KGGLEGL IHSQRRQD I LDLW I YHT 
QGYFPDWQNYTPGPGVRYPLTFGWCYKLVPVEPDKVEEANKGENTSLLHPVSLHGMD 
DPEREVLEWRFDSRLAFHHVAREIjHPEYFKNCTSEPVDPRLEPWKHPGSQPKTACTN 
CYCKKCCFHCQVCFITKALGISYGRKKRRQRRRPPQGSQTHQVSLSKQPTSQSRGDP 
TGPKETSGHHHHHH . 



=> Tat-MUTANT-HIS 



DNA sequence (Seq. ID. No. 22) 
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ATGGAGC C AGTAGATC CTAGACT AGAGC CCTGGAAGCATC 4 0 
CAGGAAGTCAGCCTAAAACTGCTTGTACCAATTGCTATTG 8 0 
TAAAAAGTGTTGCTTTCATTGCCAAGTTTGTTTCATAACA 120 
GCTGCCTTAGGCATCTCCTATGGCAGGAAGAAGCGGAGAC 16 0 
AGCG ACGAAGAC CT C CT CAAGG C AGT CAGACT C AT C AAGT 2 0 0 
TTCTCTATCAAAGCAACCCACCTCCCAATCCAAAGGGGAG 24 0 
CCGACAGGCCCGAAGGAAACTAGTGGCCAC CAT CACCAT C 2 80 
ACCATTAA 288 



Protein sequence(Seq. ID. No. 23) 

Mutated amino-acids in Tat sequences are in bold. 

MEPVDPRLEPWKHPGSQPKTACTNCYCKKCCFHCQVCFIT 4 0 
AALGISYGRKKRRQRRRPPQGSQTHQVSLSKQPTSQSKGE 8 0 

PTGPKETSGHHHHHH . 95 



=>Nef~Tat-Mutant-HIS 

DNA sequence(Seq. ID, No. 24) 

ATGGGTGGCAAGTGGTCAAAAAGTAGTGTGGTTGGATGGC 4 0 
CTACTGTAAGGGAAAGAATGAGACGAGCTGAGCCAGCAGC 8 0 
AGATGGGGTGGGAGCAGCATCTCGAGACCTGGAAAAACAT 120 
GGAGCAAT CACAAGTAGCAAT AC AGCAGCTAC CAATGCT G 160 
CTTGTGCCTGGCTAGAAGCACAAGAGGAGGAGGAGGTGGG 2 00 
TTTTCCAGTCACACCTCAGGTACCTTTAAGACCAATGACT 240 
TACAAGGCAGCTGTAGATCTTAG C CACTTTTT AAAAGAAA 280 
AGGGGGGACTGGAAGGGCTAATT C ACTC CCAACGAAGAC A 320 
AGATATCCTTGATCTGTGGATCTACCACACACAAGGCTAC 360 
TTCC CTGATTGGCAGAACTACACACCAGGGCCAGGGGTCA 400 
GATATCCACTGACCTTTGGATGGTGCTACAAGCTAGTACC 440 
AGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAAGGAGAG 4 80 
AACACCAGCTTGTTACACCCTGTGAGCCTGCATGGAATGG 520 
ATGACCCTGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAG 560 
CCGCCTAGCATTTCATCACGTGGCCCGAGAGCTGCATCCG 600 
GAGT ACTT C AAGAACTGCACT AGTG AGC C AGT AGAT C CT A 640 
G ACT AGAG C C CTGGAAG C AT CC AGGAAGT C AG CCT AAAAC 6 80 
TGCTTGTACCAATTGCTATTGTAAAAAGTGTTGCTTTCAT 720 
TGCCAAGTTTGTTTCATAACAGCTGCCTTAGGCATCTCCT 76 0 
AT GGCAGGAAGAAGCGGAGAC AG CGACGAAGACCT C CTC A 800 
AGGCAGTCAGACTCATCAAGTTTCTCTATCAAAGCAACCC 840 
ACCTC CC AATCC AAAGGGGAGCCGAC AGGC C CGAAGGAAA 880 
CTAGTGGCCAC CAT CAC CATC ACCATTAA 909 



09/50 923 » 

PCT/EP98/06040 

Protein sequence (Seq. ID. No. 25) 

Mutated amino-acids in Tat sequence are in bold. 

MGGKWSKSSWGWPTVRERMRRAEPAADGVGAASRDLEKH 4 0 

GAITS SNTAATNAACAWLEAQEEEEVGFPVTPQVPLRPMT 8 0 

YKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHTQGY 12 0 

FPDWQNYTPGPGVRYPLTFGWCYKLVPVEPDKVEEANKGE 16 0 

NTSLLHPVSLHGMDDPEREVLEWRFDSRIjAFHHVARELHP 2 00 

EYFKNCTSEPVDPRLEPWKHPGSQPKTACTNCYCKKCCFH 24 0 

CQVCFITAALGISYGRKKRRQRRRPPQGSQTHQVSLSKQP 2 8 0 

T S Q SKGE PTGP KE TS GHHHHHH . 3 n 2 



WO 99/16884 

9/17 



WO 99/16884 



10/17 



09/509239 
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. 5 SDS-PAGE: Nef-Tat-his fusion protein 



1 2 3 4 5 6 7 




Coomassie blue G250 



1: MW (175/83/62,5/47,5/32,5/25/16,5/6,5 kDa) 

2: TNH/23 SP eluate (4 ug) 

3: TNH/23 Superdex200 eluate (4 ug) 

4: TNH/23 Purified bulk (4 ug) 

5: TNH/22 Purified bulk (4 ug) 

6: TNH/23 Purified bulk (4 ug) / non reducing conditions 
7: TNH/22 Purified bulk (4 ug) / non reducing conditions 
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. 8 Cell binding assay 
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Fig. 9 Inhibition of cell growth 
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